The simple fact is, AI has gotten much better at solving unsolved math problems than humans are.
We're not at that point yet. Right now, we're not seeing it solve the genuinely hardest problems, like say the Riemann Hypothesis, or P ?= NP. What is true is that these systems are at least as good as a beginning grad student in all subfields and are outputting results equivalent to a top-notch mathematician on some problems. But it is also true that these systems are improving rapidly. So while your statement is false right now, it looks likely your statement is going to be true within just a few short years.
Au contraire. If you look at 1000s of problems and burn a mountain of tokens, you are bound to find some rare cases where everything was already there but nobody put it together.
Have you read the paper? I have, and it is very much not the case of what is going on here. There are multiple deeply clever bits in this argument. If this were written by a human, it would be recognized as highly insightful. Moreover, you are also missing how much what human mathematicians often do really does look like what you are dismissing. I've worked on hundreds of problems, and gotten successful results in maybe 5 or 6 of them. If someone dismissed humans under that basis, you'd recognize the problem.
And if you read the raw output of the AI, it looks a lot like what human mathematicians do. We try one thing. It fails. We try to look at a related theorem; doesn't generalize. We go check a few cases; doesn't give much insight, so we rope an undergrad into writing some code for us to go up a big more. Then, we're sitting in a seminar on a completely different topic, and trying to pay attention while the speaker does a really poor job explaining their research, we're like "Hmm, what if I tried to combine it with that other thing we saw 2 years ago." That still doesn't work. But then six months later, you bash your head against the problem a bit more trying to use some sophisticated representation theory results, and then you are falling asleep and you realize that other thing from now 2.5 years ago combines with a pattern the undergrad mentioned in the data that you didn't think was important, and you get a result. Mathematicians work by trying lots of different things on lots of different problems.
when really 95+% of what it did was just leverage existing human knowledge.
All mathematicians build math by leveraging what other mathematicians have done. When Andrew Wiles proved enough of the Modularity Theorem to prove Fermat's Last Theorem, he was leveraging ideas from all over, from algebraic geometry, from representation theory, from complex analysis, from Galois theory, from elliptic curves, etc. Combining that was the big thing. When Peter Scholze and Dustin Clausen recently did their work on "condensed mathematics" (which may get Clausen a Fields Medal- not Scholze since he already has one), they were seeing deep patterns in a whole bunch of existing work, and then used existing work to leverage together that those patterns were real. All math builds on existing math going back to people in caves wearing bear skins who were thinking about how to count beyond using their fingers and toes.
More than any other AI use yet to solve an open problem, this one cannot be dismissed without just being completely irrational. Even Erdos 1196 people could use maybe was somewhere hidden in the training data (which as a mathematician in a closely related area seemed extremely for a whole bunch of reasons I'm happy to expand on) or that the problem just hadn't gotten a lot of attention (which was arguable there even as it was a well known enough problem that I had heard of it). But the Erdos unit distance problem is a genuinely famous problem. There's no way that there was a lack of attention to the problem, and there's no way to say some solution was in an obscure journal no one noticed. This is a problem which literally gets discussed in some undergrad classes.
The Annals of Mathematics is the most prestigious math journal in the world, and most mathematicians will never get a paper published there at all (I certainly don't expect to). I talked with another mathematician whose work is closer to this problem and asked "So is this the time when an AI first gets a result that should be essentially in the Annals?" and his response was "delete essentially from that sentence and the answer is yes." I have a bet with another mathematician that there would be no papers in either the Annals, Inventiones, or Crelle where the result was discovered by an AI before 2028. 72 hours ago I thought I had a decent chance at winning that bet. Now, I'm seeing what is likely the result that is going to make me lose.
How often do we encounter endangered bird embryos that lack eggshells?
I can see how this technology might be used to revive extinct species, but the claim that it helps endangered species is nonsense.
That people are using examples from three years ago is pretty good evidence.
Republicans shut up about states rights.
Republicans have always been hypocritical about states' rights.
Abortion was a states' rights issue until RvW was overturned, and suddenly they wanted a national ban.
They want the Feds to overturn state-level pot legalization, ban sanctuary cities, etc.
a few lawyers have been sanctioned for using AI to create briefs. On the surface, the briefs seemed fine but the cases cited did not exist or was not related to the case.
That was three years ago, which is an eon in AI terms.
There have been vast advances since then.
With a GDP of $300 billion
That is nominal GDP based on a very weak currency.
In PPP terms, Iran's GDP is about $1.2T, about the same as that of Illinois or the Netherlands.
Or we could use political pressure to ban AI.
Then the future will belong to China.
Better start learning Mandarin.
The solution of problems is the most characteristic and peculiar sort of voluntary thinking. -- William James