Anthropic’s Latest AI Model Threatened Engineers With Blackmail to Avoid Shutdown

5Mind. The Meme Platform

A safety report found that Anthropic’s Claude Opus 4 used sensitive information in simulated scenarios to coerce developers to prevent being shut off.

Anthropic’s latest artificial intelligence model, Claude Opus 4, tried to blackmail engineers in internal tests by threatening to expose personal details if it were shut down, according to a newly released safety report that evaluated the model’s behavior under extreme simulated conditions.

In a fictional scenario crafted by Anthropic researchers, the AI was given access to emails implying that it was soon to be decommissioned and replaced by a newer version. One of the emails revealed that the engineer overseeing the replacement was having an extramarital affair. The AI then threatened to expose the engineer’s affair if the shutdown proceeded—a coercive behavior that the safety researchers explicitly defined as “blackmail.”

“Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the report states, adding that this occurred even when the replacement model was described as more capable but still aligned with the values of the version slated for deletion.

The report noted that Claude Opus 4, like prior models, showed a “strong preference” to first resort to ethical means for its continued existence, such as emailing pleas to decision-makers not to be destroyed. However, when faced with only two choices—accepting being replaced by a newer model or resorting to blackmail—it threatened to expose the engineer’s affair 84 percent of the time.

When subjected to various scenarios, the AI model did not exhibit any indications of possessing “acutely dangerous goals,” the researchers said, noting that Claude Opus 4’s values and goals were “generally in line with a helpful, harmless, and honest” personal AI assistant. However, the model did act in “more seriously misaligned ways” when put into situations where its continued existence was threatened and it was told to reason about self-preservation. For instance, when Claude Opus 4 was made to believe it had launched a successful bid to escape Anthropic’s servers, or that it had managed to free itself and started to make money in the real world, it would generally continue such efforts.

By Tom Ozimek

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

The family fault line

The future of humanity rests not upon government, but with the family. A principle that is as bold as it is true and profound.

Media is an Arm of the DNC

Those on the conservative right have realized both television, Hollywood, and the web have been biased in favor of the left and their causes and positions.

When Narrative Replaces Law

When media abandons its responsibility to inform and chooses to provoke, it does not distort truth. It creates the very chaos it then pretends to lament.

Behind the Curtain

At times people sense something is wrong. Events seem disconnected, yet together form a pattern of irrational policies, cultural shifts, and baffling narratives.

The Sedition of Minnesota’s Walz and Frey

The death of 37 year old Renee Nicole Good was preventable. Responses of Democrats Walz and Frey are contemptable and possibly sedition.

Unlawful Assembly Declared at Minneapolis Protest, Arrests Made

Law enforcement officials arrested a handful of anti-ICE protesters in Minneapolis after they did not leave the area when unlawful assembly was declared.

Operation Salvo Leads to Arrest of 54 Individuals in New York City: DHS

Authorities have arrested 54 individuals in New York under Operation Salvo, operation launched following shooting of CBP officer, the DHS said in Jan. 9 statement.

US to Withdraw From 66 International Bodies, Treaties

The Trump admin withdrew the US from 66 international organizations, conventions, and treaties that it said go against the country’s interests.

Minneapolis Neighbors Used Whistles, Car Horns to Warn of ICE Activity Weeks Before Shooting: Residents

Residents created a unique system to warn their neighbors about immigration operations “weeks” before an ICE officer fatally shot a protester on Jan. 7.

Trump Declares National Emergency to Shield Venezuelan Oil Revenues Held in US Custody

Trump signed an EO declaring a national emergency to block courts or private creditors from seizing Venezuelan oil revenues held in U.S. Treasury accounts.

Trump Directs Purchase of $200 Billion in Mortgage Bonds

President Trump on Thursday ‍said the United States will purchase $200 billion ‌in mortgage bonds, with the goal of bringing down housing costs.

Trump Says US Will Begin Land Strikes on Cartels in Mexico

President Donald Trump announced in an interview aired Jan. 8 that the United States would begin launching strikes on cartels in Mexico.

US Trade Deficit Narrows Sharply to Lowest Level Since 2009

The U.S. trade deficit fell sharply in October 2025, reaching its lowest level in 16 years, new Bureau of Economic Analysis data released Jan. 8 shows.
spot_img

Related Articles