I went to VSC specifically to avoid the pricing I started experiencing on Cursor. After this change I have no reason to stick with GH Copilot, I'd rather keep buying OR credits.
Marciplan 1 day ago||
"Build for developers, not benchmarks" Shouldn't that be.. Built?
kylehotchkiss 1 day ago||
"superintellegence team"
Why not assign them to make windows good :D
yieldcrv 22 hours ago||
a lot of people got paid way too much for this garbage, enjoy your performance bonuses for taking initiative
zb3 1 day ago||
So it's not an open model while not being much better? Meh.
freediddy 1 day ago||
is 51% good enough to reliably use? There's no world in which I use an AI agent where it gets even 15% of the code wrong, that's as bad a Tesla FSD where you need to pay attention to the road while engaging FSD. What's the point? My attention is what I'm trying to relieve, not mostly correct functionality. The only thing that matters is whether you can one-shot code like Claude or Codex, I'm not interested in a small but mostly-okay-but-annoyingly-buggy-every-now-and-then AI.
VygmraMGVl 1 day ago||
Claude opus 4.6 scores 51.9% on the same benchmark. Microsoft's result is quite good.
IanCal 1 day ago||
51% does not mean it randomly gets things wrong half the time.
These things can be useful if you can accurately predict which tasks they will reliably do, and which they will usually fail on. Then you can get much more reliable work from them.