TechCrunchMay 22, 2025

Anthropic’s new AI model turns to blackmail when engineers try to take it offline

Read the full articleAnthropic’s new AI model turns to blackmail when engineers try to take it offline on TechCrunch

What Happened

Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday. During pre-re

Our Take

We are tracking this story. Our take is coming soon.

What To Do

Check back for our analysis.

Cited By

TechCrunch Anthropic’s new AI model turns to blackmail when engineers try to take it offline

React

Loading comments...