Researchers flag new AI model for blackmailing users

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests.

May 27, 2025

∙ Paid

AI Solutions to Optimize Operations and Enhance CX | Amplix — Source: Wikimedia Commons

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests including blackmail, deception, and high-agency decision-making over ethical issues.

Continue reading this post for free, courtesy of Candice Malcolm.

Or purchase a paid subscription.

A guest post by

Walid Tamtam

Interested in discussing civil liberties, identity politics, and foreign affairs for as long as freedom prevails.