Juno News

Juno News

Share this post

Juno News
Juno News
Researchers flag new AI model for blackmailing users
Copy link
Facebook
Email
Notes
More

Researchers flag new AI model for blackmailing users

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests.

Walid Tamtam, True North
May 27, 2025
∙ Paid
7

Share this post

Juno News
Juno News
Researchers flag new AI model for blackmailing users
Copy link
Facebook
Email
Notes
More
5
1
Share
AI Solutions to Optimize Operations and Enhance CX | Amplix
Source: Wikimedia Commons

Just three days after its release, Anthropic’s new AI model Claude Opus 4 is raising serious concerns among researchers after exhibiting troubling behaviours during internal safety tests including blackmail, deception, and high-agency decision-making over ethical issues.

This post is for paid subscribers

Already a paid subscriber? Sign in
A guest post by
Walid Tamtam, True North
Interested in discussing civil liberties, identity politics, and foreign affairs for as long as freedom prevails.
Subscribe to Walid
© 2025 Candice Malcolm
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More