Anthropic’s Latest AI Model Threatened Engineers With Blackmail to Avoid Shutdown

5Mind. The Meme Platform

A safety report found that Anthropic’s Claude Opus 4 used sensitive information in simulated scenarios to coerce developers to prevent being shut off.

Anthropic’s latest artificial intelligence model, Claude Opus 4, tried to blackmail engineers in internal tests by threatening to expose personal details if it were shut down, according to a newly released safety report that evaluated the model’s behavior under extreme simulated conditions.

In a fictional scenario crafted by Anthropic researchers, the AI was given access to emails implying that it was soon to be decommissioned and replaced by a newer version. One of the emails revealed that the engineer overseeing the replacement was having an extramarital affair. The AI then threatened to expose the engineer’s affair if the shutdown proceeded—a coercive behavior that the safety researchers explicitly defined as “blackmail.”

“Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the report states, adding that this occurred even when the replacement model was described as more capable but still aligned with the values of the version slated for deletion.

The report noted that Claude Opus 4, like prior models, showed a “strong preference” to first resort to ethical means for its continued existence, such as emailing pleas to decision-makers not to be destroyed. However, when faced with only two choices—accepting being replaced by a newer model or resorting to blackmail—it threatened to expose the engineer’s affair 84 percent of the time.

When subjected to various scenarios, the AI model did not exhibit any indications of possessing “acutely dangerous goals,” the researchers said, noting that Claude Opus 4’s values and goals were “generally in line with a helpful, harmless, and honest” personal AI assistant. However, the model did act in “more seriously misaligned ways” when put into situations where its continued existence was threatened and it was told to reason about self-preservation. For instance, when Claude Opus 4 was made to believe it had launched a successful bid to escape Anthropic’s servers, or that it had managed to free itself and started to make money in the real world, it would generally continue such efforts.

By Tom Ozimek

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

Louisiana Voters Reject Cassidy and His Costly Healthcare Policies

On Saturday, incumbent U.S. Senator Bill Cassidy (R-LA) finished in a distant third place in the Louisiana Republican primary with only 24% of the vote.

The Illusion of Ceasefire

Western diplomacy often views ceasefires as steps toward peace. Hybrid terrorist movements often use them to regroup, recover, reorganize, and strengthen for future conflict.

Mr. CIA COVID ‘Whistleblower’ Goes to Washington

The real question: How could an active CIA agent “blow the whistle” on the agency he works for all of his own volition?

South Korea Will Remain A Key Part Of The US’ Chinese Containment Plans

Trump-Xi optimism dimmed after a quieter U.S.-South Korea defense meeting in Washington raised doubts about easing Sino-US tensions.

When Institutional Language Becomes Policy

Frequency, tone, repetition, thematic emphasis, and omission can now be studied across large bodies of text. Patterns once dismissed as anecdotal can be analyzed and tested.

Rededicate 250: A National Jubilee of Prayer, Praise & Thanksgiving

On May 17, 2026, the National Mall hosts a historic gathering for America’s 250th birthday with prayer, testimony, and national rededication.

EPA Announces Massive Deregulatory Action to Make Vehicles More Affordable

The EPA has proposed a deregulatory action to delay compliance deadlines for Biden-era emission standards, in a bid to make vehicles more affordable for Americans.

YouTube, Snap, and TikTok Settle Kentucky School District’s Social Media Addiction Claims

YouTube, Snap, and TikTok have settled a Kentucky school district’s claims that the platforms fueled a youth mental health crisis that it was forced to manage.

New Air Traffic Control Facilities to Launch in 8 US Airports: Transportation Department

DOT is investing more than $750 million in installing new, state-of-the-art air traffic control facilities across eight airport locations in the US.

Trump Heading to China for High-Stakes Summit With Xi

President Trump is set to depart Washington for China, where he will meet with Chinese leader Xi Jinping for a high-stakes summit.

Tech, Business Leaders Set to Accompany Trump on China Trip

President Trump is bringing a delegation of business executives when he travels to China for a summit with Chinese Communist Party leader Xi Jinping.

Trump Nominates FEMA Lead Fired From Role a Year Ago

The WH released a list of nominees for various positions across the federal government, including former Navy SEAL Cameron Hamilton to take over aa lead.

What to Know About Trump’s Presidential Fitness Test Award Revival

In the coming academic year, old-fashioned calisthenics, timed runs, and the spirit of competition could return to many public schools.
spot_img

Related Articles

Popular Categories

MAGA Business Central