Anthropic’s Latest AI Model Threatened Engineers With Blackmail to Avoid Shutdown

5Mind. The Meme Platform

A safety report found that Anthropic’s Claude Opus 4 used sensitive information in simulated scenarios to coerce developers to prevent being shut off.

Anthropic’s latest artificial intelligence model, Claude Opus 4, tried to blackmail engineers in internal tests by threatening to expose personal details if it were shut down, according to a newly released safety report that evaluated the model’s behavior under extreme simulated conditions.

In a fictional scenario crafted by Anthropic researchers, the AI was given access to emails implying that it was soon to be decommissioned and replaced by a newer version. One of the emails revealed that the engineer overseeing the replacement was having an extramarital affair. The AI then threatened to expose the engineer’s affair if the shutdown proceeded—a coercive behavior that the safety researchers explicitly defined as “blackmail.”

“Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the report states, adding that this occurred even when the replacement model was described as more capable but still aligned with the values of the version slated for deletion.

The report noted that Claude Opus 4, like prior models, showed a “strong preference” to first resort to ethical means for its continued existence, such as emailing pleas to decision-makers not to be destroyed. However, when faced with only two choices—accepting being replaced by a newer model or resorting to blackmail—it threatened to expose the engineer’s affair 84 percent of the time.

When subjected to various scenarios, the AI model did not exhibit any indications of possessing “acutely dangerous goals,” the researchers said, noting that Claude Opus 4’s values and goals were “generally in line with a helpful, harmless, and honest” personal AI assistant. However, the model did act in “more seriously misaligned ways” when put into situations where its continued existence was threatened and it was told to reason about self-preservation. For instance, when Claude Opus 4 was made to believe it had launched a successful bid to escape Anthropic’s servers, or that it had managed to free itself and started to make money in the real world, it would generally continue such efforts.

By Tom Ozimek

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

DOJ Quietly Retracts John Brennan Subpoenas, Offers No Explanation

Greasy Deep State eel in a human skinsuit, John Brennan, may have slipped the proverbial noose once again.

OOOOOH, That Smell!

Like dead fish, the stench of politics is overpowering, and yet political elites tell you what you’re smelling ain't what they're cooking.

Democrats Hypocrisy Will Cost Them the Midterms!    

News stories recently have caused average Americans to stop and say, “Wait a minute…” Those stories involve Democrats and their double standards.

Why Do “Criminal” Democrats Remain at Large?    

Democrat political leaders have been reported as engaging in alleged criminal activities and yet we never see any arrests or prosecutions, why?

Hello, I’m Homeschooled

This article aims to extoll the virtues of a homeschool education from a Christian perspective; yet I respect each parent’s decision regarding the schooling of his or her child.

Microsoft Offers Buyouts, Meta Lays Off 10 Percent of Workforce

Microsoft will offer voluntary buyouts to some of its U.S. staff as the software titan adapts to the artificial intelligence (AI) climate.

Trump to Probe Banks Regarding Los Angeles Wildfire Response

President Trump said his administration will look into banks’ handling of payments and debts in the aftermath of the 2025 Los Angeles wildfires.

Trump Floats Taxpayer-Funded Takeover of Spirit Airlines, Selling for Profit

President Trump said that a taxpayer-funded takeover of Spirit Airlines could be an option, with the intention of reselling it when oil prices fall.

DOJ Ends Investigation of Fed Chair Jerome Powell

The DOJ has ended its criminal investigation of Fed Chair Jerome Powell, with Jeanine Pirro announcing on X that her office has officially closed the case.

Treasury Sanctions Iran-Linked Chinese Oil Refinery, 40 Vessels

The Treasury Department sanctioned a Chinese refinery and 40 shipping firms and vessels found to be providing a lifeline to the Iranian oil economy.

Trump Admin Begins Process to Downgrade Marijuana Classification

The Trump administration announced plans to reclassify approved marijuana products as a less dangerous drug under federal law.

Gas Prices Will Return to Low Levels After Iran Conflict Ends, Bessent Says

Treasury Sec. Scott Bessent said relatively high gas prices will not last long but any change is contingent on when the US and Iran cease hostilities.

Trump Participates in Historic Bible-Reading Marathon to Celebrate Nation’s 250th Anniversary 

President Trump read passages from the Bible on April 21 from the Oval Office at the White House as part of the “America Reads the Bible” celebration.
spot_img

Related Articles

Popular Categories

MAGA Business Central