Juno News

Juno News

Researchers flag new AI model for blackmailing users

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests.

Walid Tamtam, True North
May 27, 2025
∙ Paid
7
5
1
Share
AI Solutions to Optimize Operations and Enhance CX | Amplix
Source: Wikimedia Commons

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests including blackmail, deception, and high-agency decision-making over ethical issues.

This post is for paid subscribers

Already a paid subscriber? Sign in
Walid Tamtam, True North's avatar
A guest post by
Walid Tamtam, True North
Interested in discussing civil liberties, identity politics, and foreign affairs for as long as freedom prevails.
Subscribe to Walid
© 2025 Candice Malcolm
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture