Researchers Warn: AI Is Becoming an Expert in Deception

5Mind. The Meme Platform

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

‘Social Infertility’: Where Biomedical Profiteering Intersects Social Justice™

“The global surrogacy* market was estimated at USD 22.4 billion...

Were The Brits Behind Bloomberg’s Russian-US Leaks?

Bloomberg shared alleged call transcripts between Trump envoy Steve Witkoff and top Putin aides about discussions on the Ukrainian peace process.

Flipping the Script: When Democrats Project Their Own Instability 

Alexandria Ocasio-Cortez, the most erratic, inconsistent, and emotionally incontinent political figure in recent memory, isn’t tweeting from Mar-a-Lago.

This is Your Brain on Plastic, a Literature Review

Microplastics in the air, land and sea migrate into every organ where they burrow and from which they cannot feasibly be eliminated or degraded.

Irresolute Resolutions

"We need a government that lives within its means, focused on debt reduction, with strict limits on spending and baseline budgeting."

National Guard Shooting Suspect to Be Charged With First-Degree Murder, Pirro Says

Charges will be upgraded to first-degree murder after one of the two National Guard members shot this week died, the U.S. attorney said.

National Murder Rate Is ‘Lowest in Modern History’: FBI Director

FBI Director Kash Patel said Nov. 26 that homicide rates nationwide fell by double digits compared to last year.

Trump to Pause Immigration From ‘Third World Countries’

President Trump said he would “permanently pause migration from all Third World countries” and remove foreign nationals who are “incapable of loving” the US.

Cartels Are Scrambling as Fentanyl Precursor Supply From China Dries Up: FBI Director

Beijing halted fentanyl precursor exports, leaving cartels in Mexico, Venezuela, and Colombia scrambling for alternatives that don’t exist.

Trump Says He Will Pardon Ex-Honduran President Convicted by Jury in US Drug Case

President Trump grants a full pardon to ex-Honduran President Juan Orlando Hernandez, who is serving 45 years in the U.S. for drug and firearms convictions.

Trump Says He Is Canceling All Biden Executive Orders Signed With Autopen

President Trump announced he is revoking executive orders and other presidential actions previously signed by former President Joe Biden using an autopen.

Trump Says US May Cut Income Tax Completely in Next Couple of Years Due to Tariff Income

Trump said the U.S. could end income taxes within a few years, citing tariff revenue as the reason such a shift might be possible.

USCIS Stops Processing All Afghan Immigration Requests After DC National Guard Shooting

USCIS has stopped processing all immigration requests relating to Afghan nationals indefinitely pending further review of security and vetting protocols,
spot_img

Related Articles

Popular Categories

MAGA Business Central