Researchers Warn: AI Is Becoming an Expert in Deception

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

Researchers Warn: AI Is Becoming an Expert in Deception

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Columnists

The Starobelsk Dormitory Bombing Reflects Horribly On Ukraine & Its Western Patrons

The cost of doing nothing

The Last Hurrah Of The Rino Establishment

Debt remembered and debt ignored

When Coincidence Ceases to Persuade

National News

US Jobless Claims Rise Marginally as Labor Market Remains Resilient

Trump Dismisses Delaney Hall Protesters as ‘Paid’ Amid Growing Scrutiny of ICE Detention Facility

Cancer Society Says Blood Testing an Option for Colon, Rectal Cancer

Judge Declines to Block Trump’s Order on Mail-In Voting

Trump 2.0

Trump Suggests Vance’s Anti-Fraud Efforts Could Save Social Security

Trump’s Triumphal Arch Approved by Federal Commission

Trump Details Military Complex Above and Below New White House Ballroom

Senate Confirms 49 Trump Nominees, Including Key Energy Officials

Related Articles

US Warns of ‘Industrial-Scale’ Efforts by China to Extract AI Technology

NASA Awards Next 2 Private Astronaut Missions to International Space Station

Musk’s SpaceX Acquires xAI to ‘Accelerate Humanity’s Future’

More Than 1 Million AI Bots Have Joined a New AI-Only Social Network

Welcome

Menu

Headlines

US Jobless Claims Rise Marginally as Labor Market Remains Resilient

Trump Dismisses Delaney Hall Protesters as ‘Paid’ Amid Growing Scrutiny of ICE Detention Facility

Cancer Society Says Blood Testing an Option for Colon, Rectal Cancer

Judge Declines to Block Trump’s Order on Mail-In Voting

Follow Us On