
Google is Using Anthropic's Claude To Improve Its Gemini AI 9
Contractors working to improve Google's Gemini AI are comparing its answers against outputs produced by Anthropic's competitor model Claude, TechCrunch reported Tuesday, citing internal correspondence. From the report: Google would not say, when reached by TechCrunch for comment, if it had obtained permission for its use of Claude in testing against Gemini.
As tech companies race to build better AI models, the performance of these models are often evaluated against competitors, typically by running their own models through industry benchmarks rather than having contractors painstakingly evaluate their competitors' AI responses. The contractors working on Gemini tasked with rating the accuracy of the model's outputs must score each response that they see according to multiple criteria, like truthfulness and verbosity. The contractors are given up to 30 minutes per prompt to determine whose answer is better, Gemini's or Claude's, according to the correspondence seen by TechCrunch.
As tech companies race to build better AI models, the performance of these models are often evaluated against competitors, typically by running their own models through industry benchmarks rather than having contractors painstakingly evaluate their competitors' AI responses. The contractors working on Gemini tasked with rating the accuracy of the model's outputs must score each response that they see according to multiple criteria, like truthfulness and verbosity. The contractors are given up to 30 minutes per prompt to determine whose answer is better, Gemini's or Claude's, according to the correspondence seen by TechCrunch.
duh (Score:4, Informative)
Yes, of course they are. Would that not be immediately obvious to do? And it doesn't need to be a competitor either.
Re: (Score:3)
Re: (Score:1)
Need laws (Score:3)
It is all intellectual theft anyways (Score:1)
I doubt any LLM scammer will be willing to set a precedent here. It is pretty clear LLM output is not copyrightable as something the LLM produced anyways, but making that even clearer could backfire rather catastrophically for this scummy industry.
They all are... (Score:1)
Because Gemini S*CKS! (Score:1)
Google's Gemini is a weak, substandard product. It's not even worth using, and they know it.
My contact said they rushed to get Gemini out, assuming improvement would eventually compete with OpenAI, etc., but this isn't working out. So there is a ton of pressure; desperate measures.
This is what happens when siblings breed. (Score:2)
I'd get popcorn, if the stuff were edible while waiting for a monster to emerge.