Researchers Warn: AI Is Becoming an Expert in Deception

5Mind. The Meme Platform

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.
00:02:04

Forged on the frontier

George Washington is widely known as a general and president, but his early life remains obscured by myth, legend, and misunderstanding.
00:02:52

A bobblehead too far

The Orioles did not just hand out a bobblehead. They sent a message that the legacy of their own players is not enough to draw.

Congress fumbles college sports

College sports landscape is a dumpster fire and every sports reporter, broadcaster and fan believes Congress needs to stay out of it.

The Hating Game

The Democrat Party game show should be titled "The Hating Game", played by pitting one class, race, or identity against another for political power.
00:09:50

The Invasion Of The Ballot Snatchers

As election results loom, California faces ballot controversies in a real-life political drama that raises concerns about election integrity.
00:04:41

US Energy Secretary Forecasts Oil, Gas Prices to Drop as Strait Traffic ‘Back to Normal’

U.S. Secretary of Energy Chris Wright said on June 21 that commercial shipping traffic through the Strait of Hormuz is “back to normal”.

FBI, DOJ Announce Arrest of Most Wanted Fraudster Herbert Leon Kimble

One of the FBI’s Most Wanted Fraudsters, Herbert Leon Kimble, who is accused of a $1.2 billion Medicare fraud, was captured in the Philippines on June 11.
00:03:31

California Declares State of Emergency Over Los Angeles Warehouse Fire, Smoke

California Gov. Gavin Newsom declared an emergency as a massive Los Angeles warehouse fire burns for a fourth day, prompting aid.
00:02:06

13th Consecutive Month of Zero Releases at Southern Border: CBP

Border Patrol released zero illegal immigrants into the United States at the southwest border for the 13th straight month in May.

Banning Hospitals’ Certain Contracts Could Save Americans $45 Billion, Report Finds

A ban on certain contracts between hospital systems and health insurers could save Americans around $45 billion, according to a report.
00:01:33

Trump Unveils New Air Force One Plane

President Trump unveiled the plane that will serve as the new Air Force One, a Boeing 747-8 luxury jet that was gifted to the US by the Qatari government in 2025.
00:01:27

Trump Threatens 100 Percent Tariff on French Wines Over Digital Services Tax

Trump threatened to impose a 100% tariff on French wines and champagne unless France eliminates its digital services tax on large American tech companies.

Trump Heads to G7 Summit in France: Here’s What to Expect

U.S. President Donald Trump is en route to France on June 15 to attend the annual G7 summit, just hours after announcing a deal with Iran.
spot_img

Related Articles

Popular Categories

MAGA Business Central