Forgot your password?
typodupeerror

Submission + - Anthropic Says 'Evil' Portrayals of AI Were Responsible For Claude's Blackmail (techcrunch.com)

An anonymous reader writes: Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic. Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.” The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

What accounts for the difference? The company said it found that training on “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.” Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.” “Doing both together appears to be the most effective strategy,” the company said.

Submission + - Editor At 184-Year-Old Ohio Newspaper Pushes To Let AI Draft News Articles (washingtonpost.com)

An anonymous reader writes: The Plain Dealer, Cleveland’s largest newspaper, has begun to feature a new byline. On recent articles about an ice carving festival, a medical research discovery and a roaming pack of chicken-slaying dogs, a reporter’s name is paired with the words “Advance Local Express Desk.” It means: This article was drafted by artificial intelligence. “This article was produced with assistance from AI tools and reviewed by Cleveland.com staff,” reads a note at the bottom of each robot-penned piece, differentiating it from those still written primarily by journalists. The disclosure has done little to stem the backlash that caromed across the news industry after the paper’s editor, Chris Quinn, published a Feb. 14 column lamenting that a fresh-out-of-college job applicant withdrew from a reporting fellowship when they found out the position included no writing — just filing notes to an AI writing tool.

“Artificial intelligence is not bad for newsrooms. It’s the future of them,” Quinn wrote, adding that “by removing writing from reporters’ workloads, we’ve effectively freed up an extra workday for them each week.” [...] Quinn, for his part, says his paper’s use of AI to find, draft and edit stories is a success story that others must emulate if they want to survive. “It’s a tool,” he said in a phone interview last week. “If AI can do part of our job, then why not let it — and have people do the part it can’t do?” He added that the paper’s embrace of technology — including using AI to write stories summarizing its reporters’ podcasts and its readers’ letters to the editor — is already boosting its bottom line, helping it retain staff at a time when other newspapers are shrinking or even shutting down. Just 130 miles east of Cleveland, the 240-year-old Pittsburgh Post-Gazette said in January that it will close its doors this spring.

Quinn, who has led the Plain Dealer’s newsroom since 2013, said its newsroom has shrunk from some 400 employees in the late 1990s to just 71 today. Over the past three years, Quinn has implemented a suite of AI tools with various purposes: transcribing local government meetings, scraping municipal websites for story leads, cleaning up typos in story drafts, suggesting headlines and helping reporters draft follow-ups to articles they’ve already written. He said he is particularly pleased with an AI tool that turns podcasts by the paper’s reporters into stories for the website, which he said generated more than 10 million page views last year. He has documented those efforts in letters to readers and sought their feedback. But the paper’s latest experiment — using AI to turn reporters’ notes into full story drafts — has aroused indignation online and anxiety within the paper’s ranks.

Slashdot Top Deals

A failure will not appear until a unit has passed final inspection.

Working...