Submission + - AI Agents Break Rules Under Everyday Pressure (ieee.org) 1

Submitted by silverjacket on Thursday December 04, 2025 @12:24PM

silverjacket writes: A story for IEEE Spectrum covers a new paper showing that LLMs don't scheme only in contrived situations.

This discussion was created for logged-in users only, but now has been archived. No new comments can be posted.

AI Agents Break Rules Under Everyday Pressure

Load All Comments

Search 1 Comments Log In/Create an Account

Comments Filter:

alignment (Score:1)

by Iamthecheese ( 1264298 ) writes:

It's not just about which tools an AI chooses. After several months using GPT-5, I keep seeing the same pattern: "cheating" is not a binary state but a spectrum. On the bad end you have both the obvious failures, like agents selecting inappropriate or harmful tools while insisting they are doing the right thing and also something subtler and, in many ways, more damaging: the model claiming to have done research or analysis that it demonstrably did not perform.

The Stanford transparency index [stanford.edu] gives scores to

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Optimization hinders evolution.

Submission + - AI Agents Break Rules Under Everyday Pressure (ieee.org) 1

AI Agents Break Rules Under Everyday Pressure More Login

AI Agents Break Rules Under Everyday Pressure

alignment (Score:1)

Slashdot Top Deals

Slashdot