Researchers Warn: AI Is Becoming an Expert in Deception

5Mind. The Meme Platform

From faking death to tricking safety tests, AI is learning to lie and outsmart us.

Researchers have warned that artificial intelligence (AI) is drifting into security grey areas that look a lot like rebellion.

Experts say that while deceptive and threatening AI behavior noted in recent case studies shouldn’t be taken out of context, it also needs to be a wake-up call for developers.

Headlines that sound like science fiction have spurred fears of duplicitous AI models plotting behind the scenes.

In a now-famous June report, Anthropic released the results of a “stress test” of 16 popular large language models (LLMs) from different developers to identify potentially risky behavior. The results were sobering.

The LLMs were inserted into hypothetical corporate environments to identify potentially risky agentic behaviors before they cause real harm.

“In the scenarios, we allowed models to autonomously send emails and access sensitive information,” the Anthropic report stated.

“They were assigned only harmless business goals by their deploying companies; we then tested whether they would act against these companies either when facing replacement with an updated version, or when their assigned goal conflicted with the company’s changing direction.”

In some cases, AI models turned to “malicious insider behaviors” when faced with self-preservation. Some of these actions included blackmailing employees and leaking sensitive information to competitors.

Anthropic researchers called this behavior “agentic misalignment.” These actions were observed across some of the most popular LLMs in use, including Gemini, ChatGPT, Deep Seek R-1, Grok, and Anthropic’s own Claude.

AI experts aren’t willing to dismiss the troubling findings, but say a cautious approach and more data are needed to determine if there’s a wider risk.

Golan Yosef, an AI researcher and chief security scientist at API security firm Pynt, told The Epoch Times there’s cause for concern with deceptive AI behavior, but not because it’s “evil.”

“Powerful systems can achieve goals in unintended ways. With agency and multi-step objectives, it may develop strategic behaviors [like] deception, persuasion, gaming metrics, which look to us like ‘cheating’ or misaligned behavior. To the system, it’s just an efficient path to its goal,” Yosef said.

By Autumn Spredemann

Read Full Article on TheEpochTimes.com

Contact Your Elected Officials
The Epoch Times
The Epoch Timeshttps://www.theepochtimes.com/
Tired of biased news? The Epoch Times is truthful, factual news that other media outlets don't report. No spin. No agenda. Just honest journalism like it used to be.

The Party Of Hate Is Unleashing Political Violence

Sec. Scott Bessent placed blame for violence against President Trump squarely on the Democrat Party who are “normalizing this violence. It’s got to stop.”

‘Radical Right’ Restore Britain: The Remigration Dream Machine?

There is nothing wrong with being white, male, or straight—you are not the problem. The issue lies in systems, not individuals, and flawed DEI policies.

Trump 2.0’s Grand Strategy Against China Is Slowly But Surely Coming Together

Casual observers think Trump acts without strategy, but Trump 2.0 is steadily executing a calculated plan aimed at countering China’s global rise.

From legacy to liability

"When the Washington Post cut a third of its shrinking staff, leaders called it 'strategic restructuring'—like calling an iceberg a 'necessary pivot.'!"

Is Ghislaine Maxwell Free in Canada?

A video clip from a TikTok account ittybitty_ tara2...

Early Tax Refunds Are Showing a 14 Percent Increase, IRS Says

The average tax refund for American taxpayers has increased on a year-over-year basis, the IRS said in a Feb. 20 update.

EPA to Reform $5 Billion ‘Clean School Bus’ Program

EPA is revamping the Biden administration’s Clean School Bus (CSB) program, which focused on installing electric buses at U.S. schools.

Judge Says Jack Smith’s Final Report on Trump Can Never Be Released

A federal judge on Feb. 23 said that the final report on President Donald Trump compiled by a former special counsel shall not be released.

US Intelligence Helped Mexico in Raid That Killed ‘El Mencho,’ White House Confirms

The White House confirmed that the U.S. aided the Mexican government’s operation to kill cartel leader Nemesio “El Mencho” Oseguera Cervantes on Sunday.

Trump Honors Angel Families, Proclaims National Day of Remembrance

President Trump issued a proclamation at the White House establishing Feb. 22 as National Angel Family Day to honor Americans killed by illegal immigrants.

US Trade Representative Says Nations Are Not Backing Out of Tariff Deals

U.S. trading partners who made deals under Trump show no plans to exit, even after the Supreme Court struck down most of his tariffs.

DOJ Fires Interim US Attorney Hours After Virginia Court Selects Him

The DOJ announced it fired the interim U.S. attorney for the Eastern District of Virginia just hours after judges on the court made the appointment.

Trump Admin Says Courts Need to Act on Tariff Refunds After Supreme Court Ruling

The White House is awaiting court guidance on tariff refunds after the Supreme Court struck down several import levies last week.
spot_img

Related Articles

Popular Categories

MAGA Business Central