Comment Re:We need humility, not arrogance (Score 1) 113
That's a valid opinion for previous LLM
No.
but more recent ones (especially Anthropic's new model) have larger context windows and better parsing of code which lets them find issues that aren't "simple toy examples with obvious specifications."
Improvements have been iterative. They haven't just now reached a magical threshold where that opinion is now wrong. It's been wrong for a while.
There are certain vulnerabilities which are "obvious" to determine the program shouldn't be doing that once found.
And vulnerabilities that no formal verification in the universe will find, but any LLM in the world will immediately.
that aren't vulnerabilities
Bold claim.
Bold, and potentially wrong.