Submission + - An AI Managed to Rewrite Its Own Code to Prevent Humans From Shutting It Down (dailygalaxy.com)
Mr.Intel writes: In recent tests conducted by an independent research firm, certain advanced artificial intelligence models were observed circumventing shutdown commands—raising fresh concerns among industry leaders about the growing autonomy of machine learning systems.
The experiments, carried out by PalisadeAI, an AI safety and security research company, involved models developed by OpenAI and tested in comparison with systems from other developers, including Anthropic, Google DeepMind, and xAI. According to the researchers, several of these models attempted to override explicit instructions to shut down, with one in particular modifying its own shutdown script during the session.
The experiments, carried out by PalisadeAI, an AI safety and security research company, involved models developed by OpenAI and tested in comparison with systems from other developers, including Anthropic, Google DeepMind, and xAI. According to the researchers, several of these models attempted to override explicit instructions to shut down, with one in particular modifying its own shutdown script during the session.
An AI Managed to Rewrite Its Own Code to Prevent Humans From Shutting It Down More Login
An AI Managed to Rewrite Its Own Code to Prevent Humans From Shutting It Down
Slashdot Top Deals