Comment Re:Claude rules (Score 1) 47
I have not found this to be true at all, especially if you include cost in the equation and speed. Absolutely Claude Opus 4.6 is one of the best when it comes to large complex tasks involving many agents and files, for now. If I am refactoring code or reviewing code, Codex is significantly better in both cost, speed, and accuracy. Gemini 3 Flash is great for very quick/cheap smaller sections of more directed code at very low cost.
I hate these tools that lock you into a single model. I use a tool set that let's me pick and choose based on what I am doing. Heck, use codex to generate your large plan, and claude opus to then execute on it and you get great results if you are doing something very large. I honestly don't think there will be one model to rule them all, and I hope soon that a true multi model tool that automates the picking of the appropriate model (cross vendor, not just between anthropic models like claude code) will come out and bring some sanity back to all of this.
The other providers are not standing still at all, and there will be a new 'best' constantly. The differences between them will get smaller and smaller, and hopefully better and better SLMs will come out that we can host on more reasonable machines ourselves for targeted tasks.
I hate these tools that lock you into a single model. I use a tool set that let's me pick and choose based on what I am doing. Heck, use codex to generate your large plan, and claude opus to then execute on it and you get great results if you are doing something very large. I honestly don't think there will be one model to rule them all, and I hope soon that a true multi model tool that automates the picking of the appropriate model (cross vendor, not just between anthropic models like claude code) will come out and bring some sanity back to all of this.
The other providers are not standing still at all, and there will be a new 'best' constantly. The differences between them will get smaller and smaller, and hopefully better and better SLMs will come out that we can host on more reasonable machines ourselves for targeted tasks.