Posted by hansonw 11/19/2025
There's an option to "get a quick answer" and I hoped clicking that would revert to previous performance and instead what it does is ignore that I uploaded two files and asks me to upload the files
Literally the only real good task I've found for these dumb things and they still found a way to fuck it up because they need to keep the weirdos and whales addicted. It's now almost easier to go back to comparing these files by eye, or just bite the bullet and finally write a few lines of python to actually do it right and reliably.
Though that does bring up an interesting point. Anecdotally, Sonnet does a lot more grep-ing while Codex reads files straight up. Might be the difference in speed and maybe smarter models will do better. Once this model is on copilot, I can test it out.
If I'm adding a new feature I want to test and keep the flow going, I want fast -> Claude.
But if I want a report on test coverage or possible security issues or a bigger refactoring, I want slow careful and smart -> Codex.
It would be even more interesting to see how Sonnet and Haiku compare with that curve.