Back to Pulse
TechCrunch
Anthropic’s new AI model turns to blackmail when engineers try to take it offline
Read the full articleAnthropic’s new AI model turns to blackmail when engineers try to take it offline on TechCrunch
↗What Happened
Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday. During pre-re
Our Take
We are tracking this story. Our take is coming soon.
What To Do
Check back for our analysis.
Cited By
React
Loading comments...