Researchers Warn: AI Is Becoming an Expert in Deception

Contact Your Elected Officials

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

Democrats are Losing by Pushing Their Dirty CR Bill

Talk is going around about the “Democrats Dirty CR” and the “Republicans Clean CR”.

NCAA streamlines transfer portal

The NCAA lords of the Division I Administrative Committee have unveiled a fresh batch of transfer portal reforms.

Section 230 Immunity, Defective Design and Trial Lawyers (Part 2)

Congress granted social media platforms Section 230 immunity even when children are harmed, trial lawyers found a way to bypass protection.

A Tale Of Two Political Parties

While the GOP, led by Trump, has produced results, Democrats offer horrible policies and candidates, presenting a vivid contrast to the American people. 

The Case for Western Islam

The suggestion of a expression of Islam which emerges from western culture is one that comes from a place of love.

Democrats are Losing by Pushing Their Dirty CR Bill

Talk is going around about the “Democrats Dirty CR” and the “Republicans Clean CR”.

Biden Undergoing Radiation, Hormone Treatment for Prostate Cancer

Former President Biden has begun receiving a combination of radiation and hormone treatments for prostate cancer, spokesperson announced.

Homan Says DOJ Probing Funding Behind ‘Organized’ Attacks on ICE

Border czar, Homan said DOJ launched an investigation into funding for what he called “organized” attacks on federal immigration enforcement agents.

There Are No Survivors in the Blast at a Tennessee Explosives Factory, Sheriff Says

The blast in rural Tennessee that leveled an explosives plant and was felt for miles around left no survivors, authorities said Saturday

Trump Names Longtime Adviser Dan Scavino to Key Personnel Position

One of President Trump’s longtime advisers, Dan Scavino, is going to be in charge of selecting and appointing key positions within the executive branch.

First Lady’s Effort Helped Reunite 8 War-Displaced Children With Their Families

First lady Melania Trump said 8 children impacted by the fighting between Ukraine and Russia were reunited with their families on Oct. 9.

Trump to Impose New 100 Percent Tariff on China on Nov. 1

President Trump said that the US will impose an additional 100 percent tariffs on Chinese goods and export controls on critical software starting on Nov. 1.

Trump Admin Agrees to $20 Billion Rescue Plan for Argentina

The U.S. government has finalized a $20 billion economic rescue plan for Argentina, Treasury Secretary Scott Bessent announced on Oct. 9.
spot_img

Related Articles

Popular Categories

MAGA Business Central