Comment Comedy gold (Score 4, Insightful) 39
"It’s worth noting that most working professionals do a lot more than submit research reports to their boss, which is all that GDPval-v0 tests for."
"OpenAI says that it believes Claude scored so high because of its tendency to make pleasing graphics, rather than sheer performance."
So by wide range of jobs, they mean jobs consisting of only submitting research reports with pleasing graphics. And based on recent measurements, about 1/4 incorrect or entirely hallucinated.
Comment My first login in years, just to express... (Score 5, Funny) 58
Comment "external Azure customers" is the real story (Score 2) 62
The key piece from this article is:
“With the incredible demand Azure is seeing and the growth of our platform, we’ve decided to pause our planned migration of LinkedIn to allocate resources to external Azure customers,” Hiremagalur wrote in his memo.
At least a couple of ways to interpret this comment. Azure either doesn't have the capacity required to support LinkedIn or the cost of running it "on-prem" is so much better than in Azure vs paying customers it doesn't make sense...or a combination of the two.
Submission + - OpenAI announces leadership transition - Sam Altman is out (openai.com)
https://openai.com/blog/openai...
Comment Re: Elon and CJ have different timelines (Score 1) 110
The Falcon heavy was also often promised and then delivered way behind schedule.
I guess the difference is no one paid for a Falcon Heavy lift in 2011.
Comment Former customer (Score 5, Funny) 25
Microsoft Partners With Docker 104
MIT Combines Carbon Foam and Graphite Flakes For Efficient Solar Steam Generati 110
Comment PCI Compliance (Score 1) 199
Video A Conversation with Ubuntu's Jono Bacon (Video) 53
Comment Re:correlation does not prove causation (Score 1) 137
Chinese Chang'e-3 Lunar Rover On Its Way After Successful Launch 101
Unlocked Firefox OS ZTE Open Is Now Available On eBay For For $80 122