Anthropic AI Model Updates

News

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

The internet freaked out after Anthropic revealed that Claude attempts to report “immoral” activity to authorities under ...

6don MSN

Anthropic's new AI model resorted to blackmail during testing, but it's also really good at coding

So endeth the never-ending week of AI keynotes. What started with Microsoft Build, continued with Google I/O, and ended with ...

Anthropic Future-Proofs New AI Model With Rigorous Safety Rules

Anthropic’s AI Safety Level 3 protections add a filter and limited outbound traffic to prevent anyone from stealing the ...

8don MSN

Anthropic’s new AI model turns to blackmail when engineers try to take it offline

Anthropic says its Claude Opus 4 model frequently tries to blackmail software engineers when they try to take it offline.

eWeek4d

New AI Model Threatens Blackmail After Implication It Might Be Replaced

Anthropic’s Claude Opus 4 exhibited simulated blackmail in stress tests, prompting safety scrutiny despite also showing a ...

7don MSN

AI model threatened to blackmail engineer over affair when told it was being replaced: safety report

Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...

3don MSN

Anthropic’s new AI model threatened to reveal engineer’s affair to avoid being shut down

In a fictional scenario set up to test Claude Opus 4, the model often resorted to blackmail when threatened with being ...

The Business Standard15h

Anthropic’s latest AI model will blackmail you if you threaten to shut it down

Launched this week, Claude Opus 4 has been praised for its advanced reasoning and coding abilities. But hidden in the launch report is a troubling revelation. In controlled experiments, the AI ...

7don MSN

Anthropic's new Claude model blackmailed an engineer having an affair in test runs

Anthropic's new model might also report users to authorities and the press if it senses "egregious wrongdoing." ...

3don MSN

When an AI model misbehaves, the public deserves to know—and to understand what it means

Safety testing AI means exposing bad behavior. But if companies hide it—or if headlines sensationalize it—public trust loses ...

eWeek1d

Anthropic CEO: AI Will Soon Take Nearly Half of Entry-Level White-Collar Jobs

Experts urge action as AI accelerates workplace automation, with warnings that entry-level roles in major industries may ...

8don MSN

Startup Anthropic says its new AI model can code for hours at a time

Artificial intelligence lab Anthropic unveiled its latest top-of-the-line technology called Claude Opus 4 on Thursday, which ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results