Researchers flag new AI model for blackmailing users
Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests.
Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests including blackmail, deception, and high-agency decision-making over ethical issues.