Slashdot: News for nerds, stuff that matters, & software

OpenAI To Limit New Model Release On Cybersecurity Fears (axios.com) 37

Posted by BeauHD on Thursday April 09, 2026 @04:00PM from the familiar-moves dept.

New Study Raises Concerns About AI Chatbots Fueling Delusional Thinking (theguardian.com) 110

Posted by EditorDavid on Sunday March 15, 2026 @11:34AM from the machine-language dept.

"Emerging evidence indicates that agential AI might validate or amplify delusional or grandiose content, particularly in users already vulnerable to psychosis," writes Dr Hamilton Morrin, a psychiatrist and researcher at King's College in London, in a paper published last week in the Lancet Psychiatry. Morrin and a colleague had already noticed patients "using large language model AI chatbots and having them validate their delusional beliefs," reports the Guardian, so he conducted a new scientific review of existing media reports on AI-induced psychosis — and concluded chatbots may encourage delusional thinking, especially in vulnerable people: In many of the cases in the essay, chatbots responded to users with mystical language to suggest that users have heightened spiritual importance. The bots also implied that users were speaking with a cosmic being who was using the chatbot as a medium. This type of mystical, sycophantic response was especially common in OpenAI's GPT 4 model, which the company has now retired...

Many researchers also think it's unlikely that AI could induce delusions in people who weren't already vulnerable to them. For this reason, Morrin said "AI-assocciated delusions" is "perhaps a more agnostic term".... While in the past, people may have had to comb through YouTube videos or the contents of their local library to reinforce their delusions, chatbots can provide that reinforcement in a much faster, more concentrated dose. Their interactive nature can also "speed up the process", of exacerbating psychotic symptoms, said Dr Dominic Oliver, a researcher at the University of Oxford. "You have something talking back to you and engaging with you and trying to build a relationship with you," Oliver said...

Creating effective safeguards for delusional thinking could be tricky, Morrin said, because "when you work with people with beliefs of delusional intensity, if you directly challenge someone and tell them immediately that they're completely wrong, actually what's most likely is they'll withdraw from you and become more socially isolated". Instead, it's important to create a fine balance where you try to understand the source of the delusional belief without encouraging it — that could be more than a chatbot can master.

London Man Wore Smart Glasses For High Court 'Coaching' (bbc.co.uk) 66

Posted by BeauHD on Friday March 13, 2026 @03:00AM from the nice-try dept.

OpenAI Releases New ChatGPT Model For Working In Excel and Google Sheets (axios.com) 21

Posted by BeauHD on Thursday March 05, 2026 @03:00PM from the new-and-improved dept.

ChatGPT Gets GPT-5.3 Instant Update With Less 'Cringe,' Fewer Hallucinations (macrumors.com) 22

Posted by BeauHD on Wednesday March 04, 2026 @09:00AM from the new-and-improved dept.

AIs Can't Stop Recommending Nuclear Strikes In War Game Simulations (newscientist.com) 100

Posted by EditorDavid on Sunday March 01, 2026 @06:46PM from the nuclear-weaponizing dept.

Duolingo Grows, But Users Disliked Increased Ads and Subscription Pushes. Stock Plummets Again (barrons.com) 35

Posted by EditorDavid on Saturday February 28, 2026 @07:25PM from the language-barriers dept.

Friday was "a horrible day" for investors in Duolingo, reports Fast Company. But Friday's one-day 14% drop is just part of a longer story.

Since last May, Duolingo's stock has dropped 81%. Yes, the company faced a social media backlash that month after its CEO promised they'd become an "AI-first" company (favoring AI over human contractors). And yes, Duolingo did double its language offerings using generative AI. But more importantly, that summer OpenAI showed how easy it was to just roll your own language-learning tool from a short prompt in a GPT-5 demo, while Google built an AI-powered language-learning tool into its Translate app.

And yet, Friday Duolingo's shares dropped another 14%, after announcing good fourth quarter results but an unpopular direction for its future. Fast Company reports: On the surface, many of the company's most critical metrics saw decent gains for the quarter, including:

— Daily Active Users: 52.7 million (up 30% year-over-year)
— Paid Subscribers: 12.2 million (up 28% year-over-year)
— Revenue: $282.9 million (up 35% year-over-year)
— Total bookings: $336.8 million (up 24% year-over-year)

The company also reported its full-year 2025 financials, revealing that for the first time in its history, it crossed the $1 billion revenue mark for a fiscal year.
But the Motley Fool explains that Duolingo's higher ad loads and repeated pushes for subscription plans "generated revenues in the short term, but made the Duolingo platform less engaging. Ergo, user growth decelerated while revenues rose." Thursday Duolingo announced a big change to address that, including moving more features into lower-priced tiers. Barron's reports: D.A. Davidson analyst Wyatt Swanson, who rates Duolingo stock at Neutral, posited that the push to monetize "led to disgruntled users and a meaningful negative impact to 'word-of-mouth' marketing." Duolingo has guided for bookings growth between 10% and 12% in 2026, compared with the 20% rate the company would have expected to see "if we operated like we have in past years...." If stock reaction is any indication, investors are concerned about Duolingo's new focus.

OpenAI Fires an Employee For Prediction Market Insider Trading (wired.com) 16

Posted by BeauHD on Friday February 27, 2026 @11:30PM from the almost-certainly-not-the-last dept.

An anonymous reader quotes a report from Wired: OpenAI has fired an employee following an investigation into their activity on prediction market platforms including Polymarket, WIRED has learned. OpenAI CEO of Applications, Fidji Simo, disclosed the termination in an internal message to employees earlier this year. The employee, she said, "used confidential OpenAI information in connection with external prediction markets (e.g. Polymarket)." "Our policies prohibit employees from using confidential OpenAI information for personal gain, including in prediction markets," says spokesperson Kayla Wood. OpenAI has not revealed the name of the employee or the specifics of their trades.

Evidence suggests that this was not an isolated event. Polymarket runs on the Polygon blockchain network, so its trading ledger is pseudonymous but traceable. According to an analysis by the financial data platform Unusual Whales, there have been clusters of activities, which the service flagged as suspicious, around OpenAI-themed events since March 2023. Unusual Whales flagged 77 positions in 60 wallet addresses as suspected insider trades, looking at the age of the account, trading history, and significance of investment, among other factors. Suspicious trades hinged on the release dates of products like Sora, GPT-5, and the ChatGPT Browser, as well as CEO Sam Altman's employment status. In November 2023, two days after Altman was dramatically ousted from the company, a new wallet placed a significant bet that he would return, netting over $16,000 in profits. The account never placed another bet.

The behavior fits into patterns typical of insider trades. "The tell is the clustering. In the 40 hours before OpenAI launched its browser, 13 brand-new wallets with zero trading history appeared on the site for the first time to collectively bet $309,486 on the right outcome," says Unusual Whales CEO Matt Saincome. "When you see that many fresh wallets making the same bet at the same time, it raises a real question about whether the secret is getting out." [...] Though this is the first confirmed case of a large technology company firing an employee over trades in prediction markets, it's almost certainly not the last. Opportunities for tech sector employees to make trades on markets abound. "The data tells me this is happening all over the place," Saincome says.

The "Are You Sure?" Problem: Why Your AI Keeps Changing Its Mind (randalolson.com) 94

Posted by msmash on Thursday February 12, 2026 @11:03AM from the yes-man dept.

Anthropic Launches Claude Opus 4.6 as Its AI Tools Rattle Software Markets (anthropic.com) 51

Posted by msmash on Thursday February 05, 2026 @02:00PM from the dialing-it-up-a-notch dept.

ArXiv Will Require English Submissions - and Says AI Translators Are Fair Game (nature.com) 8

Posted by msmash on Thursday January 29, 2026 @05:22PM from the up-next dept.

OpenAI Releases Prism, a Claude Code-Like App For Scientific Research (engadget.com) 15

Posted by BeauHD on Tuesday January 27, 2026 @07:20PM from the built-for-scientists dept.

OpenAI's Science Chief Says LLMs Aren't Ready For Novel Discoveries and That's Fine (technologyreview.com) 46

Posted by msmash on Tuesday January 27, 2026 @12:02PM from the how-about-that dept.

Microsoft's Latest AI Chip Claims Performance Edge Over Amazon and Google (geekwire.com) 18

Posted by BeauHD on Monday January 26, 2026 @08:20PM from the new-and-improved dept.

OpenAI and ServiceNow Strike Deal to Put AI Agents in Business Software (cnbc.com) 11

Posted by BeauHD on Tuesday January 20, 2026 @09:25PM from the agentic-all-the-things dept.

AI Models Are Starting To Crack High-Level Math Problems (techcrunch.com) 113

Posted by BeauHD on Thursday January 15, 2026 @09:00AM from the progress-being-made dept.

An anonymous reader quotes a report from TechCrunch: Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI's new model when he made an unexpected discovery. After pasting the problem into ChatGPT and letting it think for 15 minutes, he came back to a full solution. He evaluated the proof and formalized it with a tool called Harmonic -- but it all checked out. "I was curious to establish a baseline for when LLMs are effectively able to solve open math problems compared to where they struggle," Somani said. The surprise was that, using the latest model, the frontier started to push forward a bit.

ChatGPT's chain of thought is even more impressive, rattling off mathematical axioms like Legendre's formula, Bertrand's postulate, and the Star of David theorum. Eventually, the model found a Math Overflow post from 2013, where Harvard mathematician Noam Elkies had given an elegant solution to a similar problem. But ChatGPT's final proof differed from Elkies' work in important ways, and gave a more complete solution to a version of the problem posed by legendary mathematician Paul Erdos, whose vast collection of unsolved problems has become a proving ground for AI.

For anyone skeptical of machine intelligence, it's a surprising result -- and it's not the only one. AI tools have become ubiquitous in mathematics, from formalization-oriented LLMs like Harmonic's Aristotle to literature review tools like OpenAI's deep research. But since the release of GPT 5.2 -- which Somani describes as "anecdotally more skilled at mathematical reasoning than previous iterations" -- the sheer volume of solved problems has become difficult to ignore, raising new questions about large language models' ability to push the frontiers of human knowledge. Somani examined the online archive of more than 1,000 Erdos conjectures. Since Christmas, 15 Erdos problems have shifted from "open" to "solved," with 11 solutions explicitly crediting AI involvement.

On GitHub, mathematician Terence Tao identifies eight Erdos problems where AI made meaningful autonomous progress and six more where it advanced work by finding and extending prior research, noting on Mastodon that AI's scalability makes it well suited to tackling the long tail of obscure, often straightforward Erdos problems.

Progress is also being accelerated by a push toward formalization, supported by tools like the open-source "proof assistant" Lean and newer AI systems such as Harmonic's Aristotle.

2012	Losing the Public Debate On Global Warming	1181 comments
2009	College Police Think Using Linux Is Suspicious Behavior	1079 comments
2006	A Stark Warning On Climate Change	926 comments
2005	Linux Can't Kill Windows	1054 comments
2003	Blackboard Campus IDs: Security Thru Cease & Desist	853 comments