Researchers Warn: AI Is Becoming an Expert in Deception

5Mind. The Meme Platform

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

The Federal Courts Have Become Another Political Branch

Politics has increasingly contaminated institutions once expected to stand apart from partisan struggle—including the judiciary.

“Melania” Movie Beats Negative Pre-Hype

My wife and I went to see the “Melania”...

Democrat Wins Show GOP Voters Are Not Motivated

Democrats won a special election in Texas, taking a State Senate seat. Democrat voters are motivated, while Republican voters are not.

The Great Voter Replacement: Understanding the Modern Democratic Party

The greatest threat to democracy is a population conditioned to stop asking questions, by the very people they should question the most.

ChatGPT: Vaccine Pimp Extraordinaire

A ChatGPT discussion on giving children a drug meant to prevent a disease largely spread through IV drug use and unprotected sex exposure risks posed

Former Energy Commissioner Explains Why California Electricity Rates Nearly Double National Average

Jim Boyd, former energy commissioner for California, said that State’s average utility rate is currently about 96% higher than the rest of the nation.

Police Raid Suspected Las Vegas Biolab With Possible Ties to Illegal California Lab

Authorities in Las Vegas raided a home uncovering an alleged illegal biolab possibly linked to one run by Chinese nationals in California two years ago.

US Factory Output Rises to Near 4-Year High as Manufacturing Rebounds

U.S. manufacturing showed signs of a turnaround as factory output rose and business conditions improved after months of weakness.

Producer Marc Beckman on ‘Melania,’ a Historic Film That Captures a First Lady

Senior adviser to First Lady Melania Trump explains how the film ‘Melania’ documents a process never revealed before: preparing for the inauguration.

US, India to Slash Tariffs Under New Trade Deal, Trump Says

The US and India have reached a trade agreement and will begin lowering tariffs on each other’s goods immediately, Trump announced

Trump Says US Starting to Talk With Cuba Following Cuts to Oil Deliveries

Trump says the U.S. has begun talks with Cuban leaders as it cuts off oil from Venezuela and threatens tariffs on countries selling fuel to the island.

What to Know About Kevin Warsh, Trump’s Nominee for Fed Chair

President Donald Trump selected former Federal Reserve Governor Kevin Warsh as the next head of the U.S. central bank.

Trump Nominates Colin McDonald as Head of New Fraud Division at Justice Department

President Trump announced Colin McDonald as head for the new national fraud enforcement division of the DOJ in a post on Truth Social.
spot_img

Related Articles