Claude Code is suddenly everywhere inside Microsoft

Posted by Anon84 4 days ago

Claude Code is suddenly everywhere inside Microsoft(www.theverge.com)

406 points | 522 comments

kemotep 4 days ago|

Microsoft really needs to get a better handle with the naming conventions.

There is Microsoft Copilot, which replaced Bing Chat, Cortana and uses OpenAI’s GPT-4 and 5 models.

There is Github Copilot, the coding autocomplete tool.

There is Microsoft 365 Copilot, what they now call Office with built in GenAI stuff.

There is also a Copilot cli that lets you use whatever agent/model backend you want too?

Everything is Copilot. Laptops sell with Copilot buttons now.

It is not immediately clear what version of Copilot someone is talking about. 99% of my experience is with the Office and it 100% fails to do the thing it was advertised to do 2 years ago when work initially got the subscription. Point it a SharePoint/OneDrive location, a handful of excel spreadsheets and pdfs/word docs and tell it to make a PowerPoint presentation based on that information.

It cannot do this. It will spit out nonsense. You have to hold it by the hand tell it everything to do step by step to the point that making the PowerPoint presentation yourself is significantly faster because you don’t have to type out a bunch of prompts and edit it’s garbage output.

And now it’s clear they aren’t even dogfooding their own LLM products so why should anyone pay for Copilot?

marssaxman 4 days ago||

> Microsoft really needs to get a better handle with the naming conventions.

They really won't, though; Microsoft just does this kind of thing, over and over and over. Before everything was named "365", it was all "One", before that it was "Live"... 20 years ago, everything was called ".NET" whether it had anything to do with the Internet or not. Back in the '90s they went crazy for a while calling everything "Active".

hightrix 4 days ago|||

To further your argument, look at the XBOX. It is impossible to tell which is the latest model by name alone. Where the playstation is simple, the latest is the 5, the previous was the 4, and the one before that was the 3.

joegibbs 4 days ago|||

The Xbox One.

"Oh you mean the original one?"

No the one that came after the 360.

"The third one?"

No that was the second one, the One was the third.

"OK what are they on now?"

The Series series.

"The Series series?"

Yeah the X and S. Don't confuse that with the Xbox One X or S, or the 360 S.

"Right but what's the difference?"

The X is better than the S because X is a bigger letter. But they run the same games, but they're different. They're the same though.

ted_bunny 4 days ago|||

Nailed the "Who's on first?" vibe.

chocochunks 3 days ago|||

Oh and your Xbox Controller S won't work with your Xbox One S but it will work with your original Xbox.

coffeebeqn 4 days ago||||

Oh no I just realized the next generation will be called Microsoft 365 Xbox Copilot

pipes 4 days ago|||

ROG Ally X vs. ROG Xbox Ally X.

Also, it is possibly the worst console name of all time.

I don't even know what Xbox is now, is it a service, is it a console, I'm not even joking really.

Also visual studio code Vs full fat visual studio. Thanks Microsoft you just made it more difficult to web search both products.

Full fat .Net Vs dotnet core Vs standard or is that .net.

lovich 4 days ago||

There was also Visual Studio, the Mac app that got renamed and wasn’t the same as Visual Studio, the windows app

Sharlin 4 days ago||||

There will be an integrated voice LLM that hallucinates advice and "useful" tips as you play and cannot be turned off

CubityFirst 4 days ago|||

You jest.

https://www.xbox.com/en-US/gaming-copilot

Sharlin 4 days ago||

Sigh… Irony truly is dead.

dahcryn 3 days ago|||

"All you have to do is follow the damn train, user"

debugnik 4 days ago||||

They'll make cheating bots a first party feature just to sell Copilot.

banku_brougham 4 days ago|||

Microsoft 365 Xbox X Copilot S

jnaina 4 days ago|||

It's over 19 years old, but this video is a brutal but hilarious commentary on Microsoft's inherent dysfunction when it comes to product naming and packaging. Still on point decades later.

https://www.youtube.com/watch?v=EUXnJraKM3k

semi-extrinsic 4 days ago|||

It will come with 6 GB of RAM, unless you get the Microsoft 365 Xbox X Copilot Dynamics Pro with 13 GB RAM.

pezezin 4 days ago||||

To be fair, only Sony follows a consistent naming convention. Nintendo's console names also defy any logic, as did Sega back in the day.

OkayPhysicist 4 days ago|||

Nintendo's strategy isn't the absolute worst. They mostly just give new names to new console designs, with modifiers to specify next-gen-without-major-changes. So the SNES was a next-gen NES, the N64 was its own thing, the GameCube was its own thing, the Gameboy, Gameboy Color, and Gameboy Advanced were iterations on the same thing, DS, DSi, 3DS were all generation steps. WiiU was a next-gen Wii, Switch 2 is a next-gen Switch.

They probably should have called the WiiU the Super Wii or Wii 2 or something, but on the whole they've got a mostly coherent naming convention.

somat 4 days ago|||

I would put the would put the wii firmly in the gamecube family line. it's a uprated gamecube with a weird controller.

    nes:snes = 6502
    n64 = mips
    gamecube:wii:wiiu = powerpc
    switch:switch2 = arm

pezezin 3 days ago||

Yes, the Wii is essentially an overclocked GameCube with a bit more RAM and as you mentioned, a weird controller.

davidron 1 day ago||

As a regular human who plays games and doesn't know about chip architectures, one woud probably lump the wii and the switch closer together than the game cube based on the modes in which one can interact with the systems.

Wii is a game cube with a funny controller. Or, wii is a tv-only olde switch.

I appreciate that it has its own name due to being a transitional experience.

Macha 4 days ago|||

I don’t think Nintendo’s scheme was ever that great as it blurred the difference between variant form factors (Game Boy Pocket vs Game Boy, Game Boy Micro/SP vs Game Boy Advance, DS Lite, 2/3DS XL, Wii Mini), pro models with limited exclusives (Game Boy Color, DSi, New 2/3DS), and full on new generations (Game Boy Advance, 3DS, Wii U).

elzbardico 4 days ago||||

In terms of naming, no other entity in computing will ever be able to surpass IBM solipsistic naming habits:

System 360 OS/2 DB 2 MQ series. PC

It is like IBM just refused to entertain the idea of having competitors, why should it them name a database by any other name than DB?

adityeah 4 days ago||

That is exactly what IBM thought too when they allowed Bill Gates to license the new OS they were supposed to be making for IBM. They had no competition, who are these kids going to sell their OS to?

Fire-Dragon-DoL 3 days ago||||

You are forgetting Valve!

We got the Steam Controller and the new... Steam Controller.

We also got the Steam machine, as well as the new Steam machine.

Lol

longnighthn 3 days ago|||

Sony's naming style for its mobile phone series is also terrible.

binsquare 4 days ago||||

Some musings from someone who has not worked in microsoft but has in big tech.

This often happens because the people inside are incentivized to build their own empire.

If someone comes and wants to get promoted/become an exec, there's a ceiling if they work under the an existing umberlla + dealing the politics of introducing a feature which requires dealing with an existing org.

So they build something new. And the next person does the same. And so you have 365, One, Live, .Net, etc

josephg 4 days ago||

Google Plus was the same. Lots of unrelated google products were temporarily branded as part of google plus for some reason, including your google account and google hangouts (meet).

atombender 4 days ago||

That was a very intentional strategy. In hindsight, not a good one, of course, but Plus and its integration across the whole company was blessed by Page and Brin, who were quietly panicking that Facebook could eat Google's lunch by becoming the "start page of the Internet" the moment they integrated search. Which they never did and never appear to have wanted.

canucker2016 4 days ago||||

The Dev Tools division had Quick- prefix for some tools before settling on Visual- once VB took off.

Then there's DirectX and its subs - though Direct3D had more room for expanded feature set compared to DXSound or DXInput so now they're up to D3D v12.

arwhatever 4 days ago||||

Do your teams use the teams feature in Teams?

MengerSponge 10 hours ago||||

"The only problem with Microsoft is they just have no taste. They have absolutely no taste. And I don't mean that in a small way, I mean that in a big way, in the sense that they don't think of original ideas, and they don't bring much culture into their products." -Steve Jobs

moomin 4 days ago||||

There’s got to be solid reasons why they do this and have done so for so damn long. At the very least institutional reasons. At best, actual research that suggests they make more money this way. But as a consumer, I hate it.

estimator7292 4 days ago|||

Marketing has too much power. They get some hairbrained scheme to goose the numbers and just slam a mandate all the way down the org. Is "Copilot" not getting enough clicks? Make every button say "copilot", problem solved. Marketing doesn't know or care what was there before, someone needs numbers up to get their promotion.

phkahler 4 days ago|||

>> Is "Copilot" not getting enough clicks? Make every button say "copilot", problem solved. Marketing doesn't know or care what was there before, someone needs numbers up to get their promotion.

So Microsoft isn't bringing copilot to all these applications? It's just bringing a copilot label to them? So glad I don't use this garbage at home.

Sharlin 4 days ago||

Yes and no, because "Copilot" isn't any single thing, but can mean whatever they want it to mean in different contexts.

fluidcruft 4 days ago|||

It's marketing but I think they want everything to seem like an integrated platform so they can sell you on creeping into bundles.

Nevermark 4 days ago||||

Perhaps the "consistent" naming helps them shove more through the Enterprise door.

If a large company has bought into "Co-Pilot", they want it all right? Or not, but let's not make carving anything out easy.

Just a thought.

3acctforcom 4 days ago||

This is actually one of their smart decisions. "Copilot" is currently going through the corporate regulators, who know nothing about technology, but I can't buy it until they say everything is Legal.

So once we have signoff then my counterpart in Sharepoint/M365 land gets his "Copilot" for Office, while my reporting and analytics group gets "Copilot" for Power BI, while my coding team gets "Copilot" for llm assisted development in GitHub.

In the meantime everybody just plugs everything into ChatGPT and everybody pretends it isn't happening. It's not unlawful if they lawyers can't see it!

Nevermark 4 days ago||

Thanks for validating my intuition!

> In the meantime everybody just plugs everything into ChatGPT

I believe you meant "everyone plugs everything into ChatGPT for Co-Pilot"! A statement with its own useful ambiguities.

It is comical, but I can now make a serious addition to Sun Tzu's maxims.

“All warfare is based on deception.”

“To subdue the enemy without fighting is the acme of skill.”

"Approval is best co-opted with a polysemous brand envelope."

cornonthecobra 4 days ago|||

It's because Microsoft isn't a software company. They're a marketing company that happens to make software and a few other bits.

We're now on the back end of that, where Microsoft must again make products with independent substance, but are instead drowning in their own infrastructural muck.

wasmainiac 4 days ago|||

NT..? Good comment made me laugh.

pixl97 4 days ago|||

>Microsoft really needs to get a better handle with the naming conventions

Microsoft cannot and will not ever get better at naming things. It is said the universe will split open and and eldritch beast will consume the stars the day Microsoft stops using inconsistent and overlapping names for different and conflicting products.

Isn't that right .Net/dotnet

HPsquared 4 days ago|||

"Microsoft Re-Designs the iPod Packaging" (2006)

https://www.youtube.com/watch?v=EUXnJraKM3k

christophilus 4 days ago|||

I’d forgotten all about this gem. I think it was made by some Microsoft employees, too, which makes it even funnier to me.

butlike 4 days ago||||

That was deeply funny. I can almost smell the inside of CompUSA watching that.

imglorp 4 days ago|||

Nineteen years ago. Nothing has changed.

anonymars 4 days ago||||

Many will never know the joy of trying to search for it back in the days when punctuation was ignored (C# says hello too)

estimator7292 4 days ago||

Try going on LinkedIn and searching for C# and .net jobs.

Completely impossible. The search is bad to begin with, but it explicitly ignores anything that isn't a-9.

anonymars 4 days ago||

Ha, I stand corrected. Maybe Microsoft could reach out to the owners of LinkedIn to convince them to improve it. Oh, wait...

stackghost 4 days ago||

>LinkedIn

You mean Microsoft Career Copilot 365?

vlowther 4 days ago|||

Microsoft Active Career Copilot 365 ONE, thankyouverymuch.

josephg 4 days ago||

Grow your .NET-work!

semi-extrinsic 4 days ago||||

No, I think that's Microsoft Viva Career.

anonymars 4 days ago|||

Shh, they'll hear you!

ksec 4 days ago||||

Exactly. In the 50 years history of Microsoft, Office ( Year ) was perhaps the best they did.

Nadella might have fixed a few things, but Microsoft still have massive room for improvement in many areas.

adventured 4 days ago|||

Nadella has the golden ship taking on water right now. He has entirely botched AI top to bottom. He has screwed that up to such a degree that it would be difficult to overstate. If he doesn't correct these mistakes extremely soon, he'll unravel much of the progress he made for Microsoft and they'll miss this generation of advancement (which will be the end of their $3 trillion market cap - as the market has recently perked up to).

There is no tech giant that is more vulnerable than Microsoft is at this moment.

Most document originations will begin out of or adjacent to of LLM sessions in the near future, as everything will blur in terms of collaborating with AI agents. Microsoft has no footing (or worse, their position is terrible courtesy of copilot) and is vulnerable to death by inflection point. Windows 11 is garbage and Google + Linux may finally be coming for their desktop (no different than what AMD has managed in unwinding the former Intel monopoly in PCs).

Someone should be charging at them with a new take on Office, right now. This is where you slice them in half. Take down Office and take down Windows. They're so stupid at present that they've opened the gates to Office being destroyed, which has been their moat for 30 years.

wing-_-nuts 4 days ago|||

I am no big fan of MS, and especially not a fan of W11, but you're operating under the false assumption that their users are still their most important customers.

MS's bottom line doesn't depend on how happy users are with W11, especially not power users like ourselves. W11 is just a means of selling subscriptions (office, ai, etc). The question isn't 'are users happy' it's 'will OEMs and business continue to push it?'. The answer to that is almost certainly yes. OEMs aren't going to be selling most pcs with ubuntu included any time soon. Businesses are not going to support libreoffice when MS office is the established standard.

Maybe apple could make inroads here, but they don't seem willing to give up their profit margins on overpriced hardware, and I don't think I've ever seen them release anything 'office' related that was anywhere near feature parity with MSO, and especially not cross platform.

shawnz 4 days ago||

If their whole business is based around being an established standard and making users happy is not a relevant goal, then why do anything at all? They already are an established standard, so why would they bother taking any further actions whatsoever, making any changes or rolling out any new products? Clearly they are trying to achieve something, right? So what is it?

bluGill 4 days ago|||

It is about making specific high value users happy. If the rest of us are unhappy - we don't matter. They know for most people ubuntu or whatever isn't a realistic option and so they can take whatever money they can get from those people. Sure a few people like me will run *BSD or linux, but we are a footnote not worth their time.

The only danger is every once in a while one of those little footnotes becomes large enough to be a problem and you lose the market of those who do matter as well. While there are many obvious examples of where that happened, there are also a lot of cases where it didn't.

2snakes 4 days ago|||

It used to be empowering everyone to achieve more.

JoBrad 3 days ago||||

Agree that someone should double down on competing with Office, but that alone won’t take them down. MS has spent decades listening to what SMB to large companies need, and has layers of absolute domination to bolster their lead. As an example, they have a stranglehold on catering to the Governance, Risk, and Compliance market. The most die-hard Linux folks I know turn to AD the minute they need to manage users and devices at scale. Need visibility into what is happening both remotely and locally, across your enterprise? MS has the typing and a smart sales deck that explains how you really just need another comparatively small investment to make your board sleep better at night. And that’s not even going into the license shenanigans they play to make Azure competitive against other clouds, for hosting MS-owned tools like Windows and SQL Server.

canucker2016 4 days ago||||

Excel is the lynchpin. But you need to have a story for handling the other Office apps functionality. That's table stakes these days.

rayiner 4 days ago|||

Did they put the Teams people in charge of AI?

HPsquared 4 days ago|||

You mean Microsoft® Office™

Paradigma11 4 days ago||||

My peak experience so far was trying to search if there was an extension of dotnet interactive for visual studio or only for visual studio code.

simplyinfinity 4 days ago||

the interactive console is built into Visual Studio, no extension needed

Paradigma11 3 days ago||

https://github.com/dotnet/interactive

DrTung 4 days ago||||

I've heard the next version will be called "Visual Active NET Copilot".

anal_reactor 4 days ago||||

I'm "I don't know what Xbox is" years old.

neogodless 4 days ago|||

It's a music app. I thought that much was obvious.

Tempest1981 4 days ago||

Can I pair it with my Zune?

throw20251220 4 days ago|||

yes, directly through the Windows Phone using a Silverlight 1.0-enabled appliance

estimator7292 4 days ago|||

Seeing the name Silverlight in the wild did untold psychic damage. Excuse me while I crumble to dust.

akiselev 4 days ago|||

I think it’s an Adobe Flex app now.

neogodless 4 days ago|||

Pretty sure you can just ask Cortana to pair Xbox Music with your Zune.

anonymars 4 days ago||||

Do you mean Xbox One? Or Xbox One X? Or Xbox Series X? Or maybe Xbox Series S?

Seriously, how?

lazzurs 4 days ago|||

About a year ago I had to buy a new Xbox. It took me time to figure out what model I had and what the new models are. It’s the least intuitive marketing on the market.

anal_reactor 4 days ago||||

ROG Xbox Ally X.

But I actually had in mind the Windows app named "Xbox".

phkahler 4 days ago|||

Don't forget Xbox 360, which precluded everything 365.

josephg 4 days ago||

Well, it is a smaller number. I can't wait for the Xbox 370, just to one-up office.

commakozzi 3 days ago||

I disagree. They need an Xbox 180... get it?

gosub100 4 days ago|||

It's the successor to IXBox

i80and 4 days ago||||

I remember when everything was "Sign in with .NET Passport" as a yoot and just being like "what the hell are you talking about"

twisteriffic 4 days ago||||

Cries in dapper dapr

pradeeproark 4 days ago||||

Are we talking about .NET standard?

jslaby 4 days ago||

No, we're talking about copilot core, not copilot framework

nobodyandproud 4 days ago|||

Somewhere and in some universe there was a Microsoft that did so, wreaking havoc across the multiverse.

whobre 4 days ago|||

Not that I disagree, but this is nothing compared to the ".NET" craze in the early 2000s. Everything had to have ".NET" in its name even if it had absolutely nothing to do with the actual .NET technology.

There was also "Active" before that, but .NET was next level crazy...

anonymars 4 days ago||

I don't know, it seems comparable

Office.com is now "Welcome to Microsoft 365 Copilot"

isk517 4 days ago|||

>>Point it a SharePoint/OneDrive location, a handful of excel spreadsheets and pdfs/word docs and tell it to make a PowerPoint presentation based on that information. It cannot do this. It will spit out nonsense. You have to hold it by the hand tell it everything to do step by step to the point that making the PowerPoint presentation yourself is significantly faster because you don’t have to type out a bunch of prompts and edit it’s garbage output.

Everyone I know who use AI day-to-day is just using Copilot to mostly do things like add a transition animation to a Powerpoint slide or format a word document to look nice. The only problem these LLM products seem to solve is giving normal people a easy way to interact with terrible software processes and GUIs. And better solution to that problem would be for developers to actually observe how the average use interacts with both a computer and their program in particular.

jug 4 days ago|||

The craziest thing was how Microsoft took the super established brand from decades, and renamed Microsoft Office to Microsoft 365.

I'm not sure if it's named Microsoft 365 Copilot nowadays, or if that's an optional AI addon? I thought it was renamed once more, but they themselves claim simply "Microsoft 365" (in a few various tiers) sans-Copilot. https://www.microsoft.com/microsoft-365/buy/compare-all-micr...

timr 4 days ago|||

> There is Github Copilot, the coding autocomplete tool.

No, there is Github Copilot, the AI agent tool that also has autocomplete, and a chat UI.

I understand your point about naming, but it's always helpful to know what the products do.

mgkimsal 4 days ago|||

> No, there is Github Copilot, the AI agent tool that also has autocomplete, and a chat UI.

When it came out, Github Copilot was an autocomplete tool. That's it. That may be what the OP was originally using. That's what I used... 2 years ago. That they change the capabilities but don't change the name, yet change names on services that don't change capabilities further illustrates the OP's point, I would say.

HarHarVeryFunny 4 days ago|||

To be fair, Github Copilot (itself a horrible name) has followed the same arc as Cursor, from AI-enhanced editor with smart autocomplete, to more of an IDE that now supports agentic "vibe coding" and "vibe editing" as well.

I do agree that conceptually there is a big difference between an editor, even with smart autocomplete, and an agentic coding tool, as typified by Claude Code and other CLI tools, where there is not necessarily any editor involved at all.

almosthere 4 days ago||

all of these companies are going to follow each other's UX patterns for the rest of time.

georgeven 4 days ago||||

It was actually nearly 5 years ago!

mgkimsal 3 days ago||

Thanks... 2 years felt a bit too recent. I think I was trialing copilot in late 2022, and then got turned on to ... codeium/windsurf in late 2023. The years are merging together now. :/

timr 4 days ago|||

That's silly. Gmail is a wildly different product than it was when it launched, but I guess it doesn't count since the name is the same?

Microsoft may or may not have a "problem" with naming, but if you're going to criticize a product, it's always a good starting place to know what you're criticizing.

adastra22 4 days ago|||

Gmail is basically the same today as when I signed up for the beta. It’s a mail app.

timr 3 days ago||

Copilot is basically the same today as when I signed up for the beta. It’s a coding app.

kortilla 4 days ago||||

Gmail is almost identical today as it was when it first launched. It just has fancier JavaScript

falsemyrmidon 4 days ago|||

[dead]

jacquesm 4 days ago||||

GPs point is that it is confusing, I guess point well made?

timr 4 days ago||

Only if the naming confusion kept them from actually bothering to understand what the product is?

kemotep 4 days ago|||

The confusion is when I say “I have a terrible time using Copilot, I don’t recommend using it” and someone chimes in with how great their experience with Github Copilot is, a completely different product and how I must be “holding it wrong” when that is not the same Copilot. That Microsoft has like 5 different products all using Copilot in the name, even people in this very comment section are only saying “Copilot” so it is hard to know what product they are talking about!

timr 4 days ago||

I mean, sure. But aside from the fact that everything in AI gets reduced to a single word ("Gemini", "ChatGPT", "Claude") [1], it's clearly not an excuse for misrepresenting the functionality of the product when you're writing a post broadly claiming that their AI products don't work.

Github Copilot is actually a pretty good tool.

[1] Not just AI. This is true for any major software product line, and why subordinate branding exists.

kemotep 4 days ago||

I specifically mention that my experience is with the Office 365 Copilot and how terrible that is and in online discussions I mention this and then people jump out of the woodwork to talk about how great Github Copilot is so thank you for demonstrating that exact experience I have every time I mention Copilot :)

nananana9 4 days ago||||

Naming confusion is a pretty good predictor that it's not worth understanding what the product is.

jacquesm 4 days ago|||

Apparently, so yes.

timr 4 days ago||

Seems like there's another option.

Retric 4 days ago||

Yep, don’t use any of the products in the first place.

Leaving Microsoft’s ecosystem a few years ago has been a great productivity boost, saved quite a bit of cash, and dramatically reduced my frustration.

mirekrusin 4 days ago|||

...it gets better:

GitHub Copilot is a service, you can buy subscription from here https://github.com/features/copilot.

GitHub Copilot is available from website https://github.com/copilot together with services like Spark (not available from other places), Spaces, Agents etc.

GitHub Copilot is VSCode extension which you can download at https://marketplace.visualstudio.com/items?itemName=GitHub.c... and use from VSCode.

New version has native "Claude Code" integration for Anthropic models served via GitHub Copilot.

You can also use your own ie. local llama.cpp based provider (if your github copilot subscription has it enabled / allows it at enterprise level).

Github Copilot CLI is available for download here https://github.com/features/copilot/cli and it's command line interface.

Copilot for Pull Requests https://githubnext.com/projects/copilot-for-pull-requests

Copilot Next Edit Suggestion https://githubnext.com/projects/copilot-next-edit-suggestion...

Copilot Workspace https://githubnext.com/projects/copilot-workspace/

Copilot for Docs https://githubnext.com/projects/copilot-for-docs/

Copilot Completions CLI https://githubnext.com/projects/copilot-completions-cli/

Copilot Voice https://githubnext.com/projects/copilot-voice/

GitHub Copilot Radar https://githubnext.com/projects/copilot-radar/

Copilot View https://githubnext.com/projects/copilot-view/

Copilot Labs https://githubnext.com/projects/copilot-labs/

This list doesn't include project names without Copilot in them like "Spark" or "Testpilot" https://githubnext.com/projects/testpilot etc.

eulers_secret 4 days ago|||

Since we're talking about GitHub Copilot I'll lodge my biggest complaint about it here! The context window is stuck at 128k for all models (except maybe Codex): https://github.com/microsoft/vscode/issues/264153 and https://github.com/anomalyco/opencode/issues/5993

This absolutely sucks, especially since tool calling uses tokens really really fast sometimes. Feels like a not-so-gentle nudge to using their 'official' tooling (read: vscode); even though there was a recent announcement about how GHCP works with opencode: https://github.blog/changelog/2026-01-16-github-copilot-now-...

No mention of it being severely gimped by the context limit in that press release, of course (tbf, why would they lol).

However, if you go back to aider, 128K tokens is a lot, same with web chat... not a total killer, but I wouldn't spend my money on that particular service with there being better options!

perryh2 4 days ago||

This is the first time I've read about this. Thank you. I never noticed because OpenCode just shows you the context window usage as a %.

Octoth0rpe 4 days ago||||

I'm currently using GitHub copilot via Zed and tbh I have no idea which of these this relates to. Perhaps a combination of

> GitHub Copilot is a service

and maybe, the api behind

> GitHub Copilot is VSCode extension

???

What an absolute mess.

almosthere 4 days ago||||

Might be a good time to start a Copilot Copilot company that manages all your copilots.

3acctforcom 4 days ago|||

You want a mess?

Put together a nice and clean price list for your friends in the purchasing department.

I dare you.

jacquesm 3 days ago||

That's a feature, not a bug.

mikkupikku 4 days ago|||

> so why should anyone pay for Copilot?

The execs buying Microsoft products are presumed to be as clueless as the execs naming Microsoft products.

dec0dedab0de 4 days ago|||

You are describing everything Microsoft has done since at least the late 90s.

bluedino 4 days ago||

Things were named fine back then. Small Business Server, Office, Frontpage, Internet Information Server, Visual Studio...

kalleboo 4 days ago|||

That was back when they were going wild naming everything "Active". Active Desktop made sense, Active Directory? What made that "Active". ActiveMovie? It's just a video playing framework... ActiveX?? X?? ActiveSync, I don't want my sync to be active. ActiveStore was apparently a thing?

rvnx 4 days ago|||

It's like ChatGPT, that goes with "Sora", instead of "Image Generation", which would have been very clear

rubslopes 4 days ago|||

> It cannot do this. It will spit out nonsense.

It's unbelievable how bad they failed at this. If you do the same with Claude or ChatGPT via simple web interface, they get miles ahead.

itissid 4 days ago|||

My colleague works in a functional role for a medium sized SaaS company(1000-5000 employees), working with banks, family offices, hedge funds. They use teams and copilot, they all hate it.

One thing that I don't know about is if they have an AI product that can work on combining unstructured and databases to give better insights on any new conversation? e.g. like say the LLM knows how to convert user queries to the domain model of tables and extract information? What companies are doing such things?

This would be something that can be deployed on-prem/ their own private cloud that is controlled by the company, because the data is quite sensitive.

ajcp 4 days ago||

Databricks Genie is excellent from my experience, and provides for all your listed requirements.

raincole 4 days ago|||

> Microsoft really needs to get a better handle with the naming conventions.

AI really should be a freaking feature, not the identity of their products. What MS is doing now is like renaming Photoshop to Photoshop Neural Filter.

beart 4 days ago||

That's a great analogy, but could be taken one step further. Because Adobe would also have to rename the rest of their products to come close to what MS is doing.

  - Adobe Neural Filter Acrobat
  - Adobe Neural Filter App (previously photoshop)
  - Adobe Neural Filter Illustrator
  - Adobe 720 Neural Filter app
  - etc.

By the way, why is app lowercase in "the Microsoft 365 Copilot app"? Is it not part of the trademark but even they couldn't deal with how confusing that was?

ikr678 4 days ago||

'app' isnt part of that trademark, but on other products (Windows App) it is.

Searching the store or a company portal for one of these rebraned apps returns dozens of hits because 'windows', 'copilot', '365' and 'app' are all common words in most application descriptions.

skfiehcusjcn 4 days ago|||

People already do pay for it: office 365. It’s just like getting cloud storage with the subscription. OneDrive has been one of the better cloud storage options for consumers.

Also, a great use is Microsoft Forms I was surprised with the AI features. At first I just used it to get some qualitative feedback but ended up using copilot to enter questions Claude helped me create and it converted them into the appropriate forms for my surveys!

Objectives -> Claude -> Surveys (markdown) -> Copilot -> MS Forms -> Emailed.

Insights and analysis can use copilot too.

Main thing to remember is the models behind the scenes will change and evolve, Copilot is the branding. In fact, we can expect most companies will use multiple AI solutions/pipelines moving forward.

rustyhancock 4 days ago||

Yes. Similarly I have Gemini through having 2TB space on Google Photos.

I have 2TB with OneDrive too via a Family Office account and I've got no good reason to have the large gapps space.

A ChatGPT account and pay for two Claude accounts.

Netflix, Disney+, Prime.

How did this happen to me?

Perhaps I should sign up to one of those companies that will help me close accounts I keep seeing advertised on YouTube?

nunez 3 days ago|||

It's the same with Gemini for Google Workspace.

I asked it to create a slide deck for me, within Slides, based on a block of notes I wrote. It wouldn't do it. The chat assistant at gemini.google.com wouldn't do it either. They told me how to do it step by step though...which I knew how to do already. Useless.

I also tried the `AI()` Sheets function to fill a range in based on some other data in the sheet. It doesn't accept other ranges, even if you use the &CELL_REF& notation.

9x39 3 days ago|||

For being supposedly the same models with how close MS and OpenAI supposedly are, it's bizarrely bad.

I was trying to just get an Excel function dialed in with some IFs and formatting weirdness. The licensed Microsoft 365 Copilot built into Excel tried several times and failed miserably. One screenshot to ChatGPT (5.1?) and it was one-shot.

I'm not even sure it's the same models any more. It feels years behind. Maybe they limit it or cripple it somehow.

GaryBluto 4 days ago|||

>There is Github Copilot, the coding autocomplete tool. There is also Github Copilot, the subscription, that lets you use Anthropic, OpenAI and Google models.

dramm 2 days ago|||

It's the horrible "IBM Watson" marketing of our time, and I expect will do as badly for Microsoft as Watson did for IBM.

basch 4 days ago|||

They will never get better until naming things is a C-Suite level respected position.

I think I could clean up their existing mess if they want help.

Jedd outlines my credentials well here https://news.ycombinator.com/item?id=17522649#17522861

hbn 4 days ago|||

> There is Github Copilot, the coding autocomplete tool.

It's also an LLM chat UI, I don't know if it's because of my work but it lets me select models from all of the major players (GPT, Claude, Gemini)

https://github.com/copilot

JumpCrisscross 4 days ago|||

I’m not a gamer. But it still strikes me as wild that they let go of the Cortana moniker.

0xbadcafebee 4 days ago|||

This isn't a Microsoft thing, it's a big dumb corporation thing. Most big corporations are run by dumb executives who are 100% out of touch with the customer (though even if they were in touch, they wouldn't care). Their only consideration is the stock price. If adding new names to things, chanting the magic spell "AI" over and over, and claiming the new name will make them more money can cause the stock price to increase, that's what they'll do. (Making customers happy doesn't make the stock price rise; if it did, we'd all be a lot less depressed and a lot richer)

krzyk 4 days ago|||

> There is Github Copilot, the coding autocomplete tool.

It is a coding everything, autocomplete, ask, edit files and an agent (claude code like).

p0w3n3d 4 days ago|||

Sorry for a question out of topic, but is there still an offline office license available? So you buy it and have it locally?

felixg3 4 days ago|||

There is also Security Copilot, which I accidentally enabled and got a 3000€ invoice by azure. Thanks, M$!

sandos 3 days ago|||

Github copilot is not just autocompletion, its a chat/agent thing in VS Code.

adamrezich 4 days ago|||

> Laptops sell with Copilot buttons now.

Is it the context menu key? Or did they do another Ctrl+Alt+Shift+Win+L thing?

lionkor 4 days ago||

Oh no it's a new key and it's on your keyboard.

adamrezich 4 days ago|||

Apparently it's Win+Shift+F23.

1718627440 3 days ago|||

The issue is less that it is a new key and more that the menu key is missing instead.

Foobar8568 4 days ago|||

It reminds me of IBM and Watson, most likely the same brain rot at the top.

boredatoms 4 days ago|||

Its long term microsoft culture to be horrific at external naming

rdsubhas 4 days ago|||

So I guess the same situation as with Google Gemini.

hoppp 4 days ago|||

They are vibing the naming probably

racl101 4 days ago|||

Have you seen their Xbox line up?

codethief 4 days ago|||

Don't forget Azure Copilot :)

solumunus 3 days ago|||

Classic Microsoft.

NuclearPM 4 days ago|||

Xbox one series X

They sucks at names.

NuclearPM 2 days ago||

And I can’t edit this mistake…

bandrami 4 days ago|||

Can somebody give me a TLDR on what the "copilot button" does? I've never had one of those laptops and have never understood that. Does it just start the AI front-end? Does it power up the NPU?

sebazzz 4 days ago||

It is just a button. Default it starts the Copilot app which is really the Office app that already existed but now with the copilot tab preselected. Also that Copilot runs in the cloud and doesn't use your NPU.

The only thing until now I've found using the NPU are the built in blur, auto frame and eye focus modes for the webcam.

bandrami 4 days ago||

I got one of those Asus ROG laptops that bragged about having an NPU and I kept trying to figure out how to access it until I realized they just meant the integrated Ryzen GPU (which is also responsible for getting anything rendered by the discrete Nvidia GPU to the actual display). Also its alleged 16G of video memory included a mapped 8G of system RAM. I miss when vendors at least pretended to be honest.

dobin 4 days ago|||

Like Microsoft Defender, which is now Defender Antivirus, or Defender for Endpoint if you have a real license. You will also get Defender for Identity, and maybe Defender for Office 365, which is probably not ASR. And Defender for Cloud, not to be confused with Defender for Cloud Apps.

Oras 4 days ago|||

You need to see how many times they changed AI related services in Azure. It’s a shit show.

major505 4 days ago||

This is what happens when you let all decision to the marketing team without any supervision. They became full retarded.

Marketing need as much supervision as a toddler in a cristal store.

tylerchilds 4 days ago||

This is funny because everyone’s AI strategy should have been

“What do we actually need to be productive?”

Which is how Anthropic pulled ahead of Microsoft, that prioritized

checks notes

Taking screenshots of every windows user’s desktop every few seconds. For productivity.

halapro 4 days ago||

Fun fact: I used to automatically screenshot my desktop every few minutes eons ago. This would occasionally save me when I lost some work and could go back to check the screenshots.

I only gave it up because it felt like a liability and, ahem, it was awkward to review screenshots and delete inopportune ones.

Sharlin 4 days ago||

Long time ago I had a script that would regularly screenshot my desktop… and display the latest screenshot on a page in my `public_html`, on the public web. Just because I thought it would be fun.

tommica 4 days ago||

You were a brave, brave person!

bobsmooth 4 days ago|||

Recall actually sounds like it could be useful but there's a snowball's chance in hell that I would trust Microsoft to not spy on me.

jacquesm 4 days ago|||

On the contrary, you could trust it 100% to spy on you. That's the whole reason that functionality exists.

Nevermark 4 days ago||

Always trust people. Trust people to be themselves.

For some reason, people have great cognitive difficulty with defensive trust. Charlie Brown, Sally.

dangus 4 days ago|||

I don’t plan on using the feature and I don’t plan on using Windows much longer in the first place, but I find that going beyond the ragebait headlines and looking at the actual offering and its privacy policy and security documentation makes it look a lot more reasonable.

Microsoft is very explicit in detailing how the data stays on device and goes to great lengths to detail exactly how it works to keep data private, as well as having a lot of sensible exceptions (e.g., disabled for incognito web browsing sessions) and a high degree of control (users can disable it per app).

On top of all this it’s 100% optional and all of Microsoft’s AI features have global on/off switches.

Dusseldorf 4 days ago|||

Until those switches come in the crosshairs of someone's KPIs, and then magically they get flipped in whatever direction makes the engagement line go up. Unfortunately we live in a world where all of these companies have done this exact thing, over and over again. These headlines aren't ragebait, they're prescient.

dangus 4 days ago||

Well, now you’re just doing the same exact thing I described. You’re basically making up hypothetical things that could happen in the future.

I’ll agree with you the moment Microsoft does that. But they haven’t done it. And again, I’m not their champion, I’m actively migrating away from Microsoft products. I just don’t think this type of philosophy is helpful. It’s basically cynicism for cynicism’s sake.

tylerchilds 4 days ago|||

Here are the settlements from Apple and Google regarding “how phones totally aren’t listening to you and selling the data to advertisers”

https://www.cbsnews.com/news/google-voice-assistant-lawsuit-...

https://www.cbsnews.com/news/lopez-voice-assistant-payout-se...

dangus 4 days ago||

1. Not related to the issue at hand, a completely different system implemented in a completely different way.

2. Settlements are just that: settlements. You can be sued frivolously and still decide to settle because it’s cheaper/less risky.

tylerchilds 4 days ago||

1. Any kernel level vulnerability nullify any formal protections Microsoft guarantees as the first party

https://www.bcs.org/articles-opinion-and-research/crowdstrik...

2. Settlements also avoid discovery because the impact is likely way worse than checks notes less than one day of profits per company, respectively.

dangus 3 days ago||

1. More irrelevant stuff. A kernel level vulnerability can nullify all sorts of good faith security design.

2. I could sue you today for, well, pretty much anything. I don’t have a good case but I can file that lawsuit right now. Would you rather take my settlement offer of $50 or pay a lawyer to go to trial and potentially spend the next months/years of your life in court? You can’t make a blanket statement to say that every company that decides to settle has something to hide, or, similarly, that everyone who exercises their 4th amendment rights has something to hide. I will also point out that companies that make lots of money are huge lawsuit targets, e.g., patent trolls sue large corporations all the time.

Don’t forget we are here talking about a fully optional feature that isn’t even turned on by default. I’m not telling you to love Windows Recall, turn it off or switch to Linux if you don’t love it. My only point is that it’s gotten a lot of incorrect news and social media coverage that is factually untrue and designed to get clicks and reinforce feelings.

tylerchilds 3 days ago||

1. Most people don’t realize kernel hacks undermine their entire mental model of security— tbh, only after crowdstrike did I learn it was possible to mass blue screen a population by a security vendor

2. I’m very much already on Linux, most of my threat model is: “if it’s technically possible, it’s probable” and I adjust my technology choices accordingly

I’m just saying a max cap of $60 for Apple’s settlement sets precedence for future mass surveillance wrist slaps and maybe it would be worth the discovery process to uncover the actual global impact

paxys 4 days ago|||

Anthropic has a model. Microsoft doesn't.

satvikpendem 4 days ago|||

Microsoft can use OpenAI models but it's not the model that's the problem, it's the application of them. Anthropic simply knows how to execute better.

doomslayer999 4 days ago|||

Anthropic's Models are better though. It may not "perform" as well on the LLM task benchmarks, but its the only one that actual gives semi-intelligent responses and seems aligned with human wants. And yes, they definitely have much better execution. It's the only one I considered shelling out 20 bucks for.

satvikpendem 4 days ago||

GPT 5.2 Codex is often better and more thorough than Opus 4.5, it's just slower.

bhadass 4 days ago||||

they should just acquire one of the many agent code harnesses. Something like opencode works just as well as claude-code and has only been around half of the time.

w0m 4 days ago||

I used opencode happily for a while before switching to copilot cli. Been a minute , but I don't detect a major quality difference since they added Plan mode. Seems pretty solid, and first party if that matters to your org.

formerly_proven 4 days ago|||

As evidenced by Anthropic models not performing well in github presents copilot.

speedgoose 4 days ago||

I read that a few times but from my personal observations, Claude Opus 4.5 is not significantly different in GitHub Copilot. The maximum context size is smaller for sure, but I don’t think the model remembers that well when the context is huge.

pixl97 4 days ago||||

Microsoft has a model nearly as old as the company.

Attempt to build a product... Fail.

Buy someone else's product/steal someone else's product... Succeed.

icedchai 4 days ago|||

We love to hate on Microsoft here, but the fact is they are one of the most diversified tech companies out there. I would say they are probably the most diversified, actually. Operating systems, dev tools, business applications, cloud, consumer apps, SaaS, gaming, hardware. They are everywhere in the stack.

Octoth0rpe 4 days ago|||

That's a "business" model, not a language model, which I believe is what the poster is referring to. In any case though, MS does have a number of models, most notably Phi. I don't think anyone is using them for significant work though.

pixl97 4 days ago||

It's a word play, if their LLM model sucks too much they'll get someone else's.

I mean they fought the browser war for years, then just used Chrome.

torginus 4 days ago|||

Which is kind of a bummer - it'd have helped the standards based web to have an actual powerful entity maintain a distinct implementation. Firefox is on life-support and is basically taking code from Blink wholesale, and Webkit isn't really interested in making a browser thats particularly compliant to web standards.

MS's calculus was obvious - why spend insane amounts of engineering effort to make a browser engine that nobody uses - which is too bad, because if I remember correctly they were not too far behind Chrome in either perf or compatibility for a while.

Nevermark 4 days ago||

It would have helped the standards based web, if the standards based web wasn't a fermenting spaghetti monster.

torginus 4 days ago||

From what I've heard a W3C standards meeting is basically a Zoom call between Blink and Webkit engineers.

canucker2016 4 days ago|||

Well, they fought hard until IE6.

Then they took their eyes off the ball - whether it was protecting the Windows fort (why create an app that has all the functionality of an OS that you give away for free - mostly on Windows, some Mac versions, but no Linux support) when people are paying for Windows OR they just diverted the IE devs to some other "hot" product, browser progress stagnated, even with XMLHttpRequest.

jug 4 days ago||||

They do have some in-house LLM's (Phi) but they seem to either have issues with, or not thinking it's worth it, to develop large flagship ones.

tylerchilds 4 days ago||||

One has existed since the 80s, when was the other founded?

Gud 4 days ago||

What does it matter? And Microsoft was founded in the 70s..

iAMkenough 4 days ago||

I think they're implying Microsoft is having a Kodak moment

bee_rider 4 days ago|||

A large language model, or a business model?

zamadatix 4 days ago|||

Recall is great for bashing but relatively inconsequential to anything Microsoft has been doing in this space outside that.

rustyhancock 4 days ago||

Although it seems in Europe we might all end up with recall style screenshots and scanning of what we're looking at.

Part of me wonders if Microsoft knew it would appeal to governments.

https://arstechnica.com/tech-policy/2025/12/uk-to-encourage-...

tylerchilds 4 days ago||

I mean. Ask any gamer if the original Xbox One announcement needing a Kinect and persistent internet connection was a feature request from them or a three letter org.

As someone that was there, we saved the Xbox brand by bullying Microsoft out of normalizing spying on kids and their whole families.

luddit3 4 days ago||

You were robbed last night. No way Jelly Roll should have won.

tylerchilds 4 days ago||

I love you for this reference lol

I hate how I’ve had a web site with my name on it since 2008 and when you google my name verbatim it says “did you mean Tyler Childers”

Such shade from the algorithm, I get it, I get it, software is lamer than music.

paxys 4 days ago||

For one reason or another everyone seems to be sleeping on Gemini. I have been exclusively using Gemini 3 Flash to code these days and it stands up right alongside Opus and others while having a much smaller, faster and cheaper footprint. Combine it with Antigravity and you're basically using a cheat code.

TheAceOfHearts 4 days ago||

This comment is a bit confusing and surprising to me because I tried Antigravity three weeks ago and it was very undercooked. Claude was actually able to identify bugs and get the bigger picture of the project, while Gemini 3 with Antigravity often kept focusing on unimportant details.

My default everyday model is still Gemimi 3 in AI Studio, even for programming related problems. But for agentic work Antigravity felt very early-stages beta-ware when I tried it.

I will say that at least Gemimi 3 is usually able to converge on a correct solution after a few iterations. I tried Grok for a medium complexity task and it quickly got stuck trying to change minor details without being able to get itself out.

Do you have any advice on how to use Antigravity more effectively? I'm open to trying it again.

paxys 4 days ago|||

Ask it to verify stuff in the browser. It can open a special Chrome instance, browse URLs, click and scroll around, inspect the DOM, and generally do whatever it takes to verify that the problem is actually solved, or it will go back and iterate more. That feedback loop IMO makes it very powerful for client-side or client-server development.

sherlock_h 3 days ago||

How does that work? Do you have a link to documentation?

Analemma_ 4 days ago||||

I've mentioned this before, but I think Gemini is the smartest raw model for answering programming questions in chatbot mode, but these CC/Codex/gemini-cli tools need more than just the model, the harness has to be architected intelligently and I think that's where Google is behind for the moment.

8note 4 days ago|||

gemini flash in their claude code compatitor does pretty well if you give it alternative tools.

the tools its built with seem to suck, but it can cook with serena mcp.

the flash models seem to get better results than the pro ones as far as ive seen, but theres not a big difference

ralusek 4 days ago|||

I think Gemini is an excellent model, it's just not a particularly great agent. One of the reasons is that its code output is often structured in a way that looks like it's answering a question, rather than generating production code. It leaves comments everywhere, which are often numbered (which not only is annoying, but also only makes sense if the numbering starts within the frame of reference of the "question" it's "answering").

It's also just not as good at being self-directed and doing all of the rest of the agent-like behaviors we expect, i.e. breaking down into todolists, determining the appropriate scope of work to accomplish, proper tool calling, etc.

freedomben 4 days ago|||

Yeah, you may have nailed it. Gemini is a good model, but in the Gemini CLI with a prompt like, "I'd like to add <feature x> support. What are my options? Don't write any code yet" it will proceed to skip right past telling me my options and will go ahead an implement whatever it feels like. Afterward it will print out a list of possible approaches and then tell you why it did the one it did.

Codex is the best at following instructions IME. Claude is pretty good too but is a little more "creative" than codex at trying to re-interpret my prompt to get at what I "probably" meant rather than what I actually said.

phainopepla2 4 days ago|||

Try the conductor extension for gemini-cli: https://github.com/gemini-cli-extensions/conductor

It won't make any changes until a detailed plan is generated and approved.

michaelcampbell 4 days ago||||

Can you (or anyone) explain how this might be? The "agent" is just a passthrough for the model, no? How is one CLI/TUI tool better than any other, given the same model that it's passing your user input to?

I am familiar with copilot cli (using models from different providers), OpenCode doing the same, and Claude with just the \A models, but if I ask all 3 the same thing using the same \A model, I SHOULD be getting roughly the same output, modulo LLM nondeterminism, right?

taylorius 4 days ago||

maybe different preparatory "system" prompts?

PantaloonFlames 4 days ago|||

I've had the exact opposite experience. After including in my prompt "don't write any code yet" (or similar brief phrase), Gemini responds without writing code.

Using Gemini 2.5 or 3, flash.

sutterd 4 days ago|||

My go-to models have been Claude and Gemini for a long time. I have been using Gemini for discussions and Claude for coding and now as an agent. Claude has been the best at doing what I want to do and not doing what I don’t want to do. And then my confidence in it took a quantum leap with Opus 4.5. Gemini seems like it has gotten even worse at doing what I want with new releases.

jckahn 4 days ago|||

Yeah I don't understand why everyone seems to have forgotten about the Gemini options. Antigravity, Jules, and Gemini CLI are as good as the alternatives but are way more cost effective. I want for nothing with my $20/mo Google AI plan.

paxys 4 days ago|||

Yeah I'm on the $20/mo Google plan and have been rate limited maybe twice in 2 months. Tried the equivalent Claude plan for a similar workload and lasted maybe 40 minutes before it asked me to upgrade to Max to continue.

lelanthran 3 days ago||

> Yeah I'm on the $20/mo Google plan and have been rate limited maybe twice in 2 months. Tried the equivalent Claude plan for a similar workload and lasted maybe 40 minutes before it asked me to upgrade to Max to continue.

The TLDR: The $20/40m cost is more reflective of what inference actually costs, including the amortised cost of the Capex, together with the Opex.

The Long Read:

I think the reason is because Anthropic is attempting to run inference at a profit and Google isn't.

Another reason could be that they don't own their cost centers (GPUs are from Nvidia, Cloud instances are from AWS, data centers from AWS, etc); they own only the model but rent everything else needed for inference so pay a margin for all those rented cost centers.

Google owns their entire vertical (GPUs are google-made, Cloud instances and datacenters are Google-owned, etc) and can apply vertical cost optimisations, so their final cost of inference is going to be much cheaper anyway even if they were not subsidising inference with their profits from unrelated business units.

jckahn 3 days ago||

Well said.

It's for exactly this reason that I believe Google will win the AI race.

codazoda 4 days ago||||

It's crazy that we're having such different experiences. I purchased the Google AI plan as an alternative to my ChatGPT (Codex) daily driver. I use Gemini a fair amount at work, so I thought it would be a good choice to use personally. I used it a few times but ran into limits the first few projects I worked on. As a result I switched to Claude and so, far, I haven't hit any limits.

riku_iki 4 days ago|||

Google has uncertain privacy settings, there is no declaration they won't train their LLM on your personal/commercial code.

Zopieux 4 days ago||

https://macaron.im/blog/ai-assistant-privacy-comparison#:~:t...

All providers are opt-out. The moat is the data, don't pretend like you don't know.

riku_iki 4 days ago||

per my previous research there is no opt out for gemini cli.

jug 4 days ago|||

I've heard Opus 4.5 might have an edge especially in long running agentic coding scenarios (?) but personally yes Gemini 3 series is what I was expecting GPT-5 to be.

I'm also mostly on Gemini 3 Flash. Not because I've compared them all and I found it the best bar none, but because it fulfills my needs and then some, and Google has a surprisingly little noted family plan for it. Unlike OpenAI, unlike Anthropic. IIRC it's something like 5 shared Gemini Pro subs for the price of 1. Even being just a couple sharing it, it's a fantastic deal. My wife uses it during studies, I professionally with coding and I've never run into limits.

pRusya 4 days ago|||

It's the opposite experience for me. Gemini mostly produces made up and outdated stuff.

whalee 4 days ago|||

I think counter to the assumption of myself (and many), for long form agent coding tasks, models are not as easily hot swappable as I thought.

I have developed decent intuition on what kinds of problems Codex, Claude, Cursor(& sub-variants), Composer etc. will or will not be able to do well across different axes of speed, correctness, architectural taste, ...

If I had to reflect on why I still don't use Gemini, it's because they were late to the party and I would now have to be intentional about spending time learning yet another set of intuitions about those models.

codazoda 4 days ago||

I feel like "prompting language" doesn't translate over perfectly either. It's like we become experts at operating a particular AI agent.

I've been experimenting with small local models and the types of prompts you use with these are very different than the ones you use with Claude Code. It seems less different between Claude, Codex, and Gemini but there are differences.

It's hard to articulate those differences but I think that I kind of get in a groove after using models for a while.

codazoda 4 days ago|||

I've used Gemini CLI a fair amount as well—it's included with our subscription at work. I like it okay, but it tends to produce "lies" a bit too often. It tends to produce language that reads as over confident that it's found a problem or solution. This causes me extra work to verify or causes me extra time because I believed it. In my experience Claude Code does this quite a bit less.

OsrsNeedsf2P 4 days ago|||

For all the hype I see about Gemini, we integrated it with our product (an AI agent) and it consistently performs worse[0] than Claude Sonnet, Opus, and ChatGPT 5.2

[0] based on user Thumbs up/Thumbs down voting

qaq 4 days ago|||

Maybe it's the types of projects I work on but Gemini is basically unusable to me. Settled on Claude Code for actual work and Codex for checking Claude's work. If I try to mix in Gemini it will hallucinate issues that do not exist in code at very high rate. Claude and Codex are way more accurate at finding issues that actually exist.

CuriouslyC 4 days ago|||

Oddly enough, as impressive as Gemini 3 is, I find myself using it infrequently. The thing Gemini 2.5 had over the other models was dominance in long context, but GPT5.2-codex-max and Opus 4.5 Thinking are decent at long context now, and collectively they're better at all the use cases I care about.

notatoad 4 days ago|||

I can think of one major reason why Microsoft and Apple would prefer to feed their codebases into Claude than to Gemini.

psyclobe 4 days ago|||

I tried to use it, kept saying it was at max capacity and nothing would happen. I gave it a good day before giving up.

zrn900 3 days ago|||

What everyone is really sleeping on is Deepseek paid API with Cline and VSCode. An agent that can refactor entire codebases with a 128.0k context window that costs dimes. It generates entire blocks of code and tests them for $0.02 a pop. Deepseek paid API brings the low cost large context window and memory. VSCode the interface, CLine the agent.

aantix 4 days ago|||

Not my experience at all.

It fails to be pro-active. "Why didn't you run the tests you created?"

I want it to tell me if the implementation is working.

Feels lazy. And it hallucinates solutions frequently.

It pales in comparison to CC/Opus.

zhengyi13 4 days ago||

I feel like this is exactly the use case for things like Hooks and Skills. Which, if you don't want to write them yourself, I get it. But I do think we can get the tool to do it; sounds like you want it doing that a little more actively/out-of-the-box?

bastawhiz 4 days ago|||

I've never, ever had a good experience with Gemini (3 Pro). It's been embarrassingly bad every time I've tried it, and I've tried it lots of times. It overcomplicates almost everything, hallucinates with impressive frequency, and needs to be repeatedly nudged to get the task fully completed. I have no reason to continue attempting to use it.

JoshMandel 4 days ago||

Same. Sometimes even repeated nudges don't help. The underlying 3.0 Pro model is great to talk and ideate with, but its inability to deliver within the Gemini CLI harness is ... almost comical.

mfro 4 days ago|||

For me it just depends on the project. Sometimes one or the other performs better. If I am digging into something tough and I think it's hallucinating or misunderstanding, I will typically try another model.

TZubiri 4 days ago|||

I don't think anyone is sleeping on it.

It's on the top of most leaderboards on lmarena.ai

jonathanstrange 4 days ago|||

I'm also using Gemini and it's the only option that consistently works for me so far. I'm using it in chat mode with copy&paste and it's pleasant to work with.

Both Claude and ChatGPT were unbearable, not primarily because of lack of technical abilities but because of their conversational tone. Obviously, it's pointless to take things personally with LLMs but they were so passive-aggressive and sometimes maliciously compliant that they started to get to me even though I was conscious of it and know very well how LLMs work. If they had been new hires, I had fired both of them within 2 weeks. In contrast, Gemini Pro just "talks" normally, task-oriented and brief. It also doesn't reply with files that contain changes in completely unrelated places (including changing comments somewhere), which is the worst such a tool could possibly do.

Edit: Reading some other comments here I have to add that the 1., 2. ,3. numbering of comments can be annoying. It's helpful for answers but should be an option/parameterization.

lelanthran 3 days ago|||

> Both Claude and ChatGPT were unbearable, not primarily because of lack of technical abilities but because of their conversational tone.

It's pretty much trial and error.

I tried using ChatGPT via the webchat interface on Sunday and it was so terse and to the point that it was basically useless. I had to repeatedly prompt for all the hidden details that I basically gave up and used a different webchat LLM (I regularly switch between ChatGPT, Claude, Grok and Gemini).

When I used it a month ago, it would point out potential footguns, flaws, etc. I suppose it just reinforces the point that "experience" gained using LLMs is mostly pointless, your experience gets invalidated the minute a model changes, or a system prompt changes, etc.

For most purposes, they are all mostly the same i.e. produce output so similar you won't notice a difference.

bonesss 4 days ago|||

I think you’re highlighting an aspect of agentic coding that’s undervalued: what to do once trust is breached… ?

With humans you can categorically say ‘this guy lies in his comments and copy pastes bullshit everywhere’ and treat them consistently from there out. An LLM is guessing at everything all the time. Sometimes it’s copying flawless next-level code from Hacker News readers, sometimes it’s sabotaging your build by making unit tests forever green. Eternal vigilance is the opposite of how I think of development.

satvikpendem 4 days ago|||

Eh, it's not near Opus at all, closer to Sonnet. It is nice though with Antigravity because it's free versus being paid in other IDEs like Cursor.

causal 4 days ago||

Yeah use Flash 3 for easy + fast stuff, but it can't hold the plot like Opus or Codex 5

tiangewu 4 days ago|||

[dead]

catlover76 4 days ago|||

It's ok, but it too frequently edits WAY more than it needs to in order to accomplish the task at hand.

GPT-5.2 sometimes does this too. Opus-4.5 is the best at understanding what you actually want, though it is ofc not perfect.

dingnuts 4 days ago||

[dead]

paxys 4 days ago||

Crazy to think that Github Copilot was the first mainstream AI coding tool. It had all the hype and momentum in the world, and Microsoft decided to do...absolutely nothing with it.

leoedin 4 days ago||

I use Copilot in VSCode at work, and it's pretty effective. You can choose from quite a few models, and it has the agentic editing you'd expect from an IDE based AI development tool. I don't know if it does things like browser integration because I don't do frontend work. It's definitely improved over the last 6 months.

There's also all the other Copilot branded stuff which has varying use. The web based chat is OK, but I'm not sure which model powers it. Whatever it is it can be very verbose and doesn't handle images very well. The Office stuff seems to be completely useless so far.

Sammi 4 days ago|||

Have you tried any other popular agentic coding tool? Like Claude Code, Cursor, Opencode, or Codex or something else? Because I've used all of these and Copilot in anger in the last three months, and Copilot wasn't even in the same league as the others. Comparatively it just plain sucked. Slow and gave poor results. All the others I mentioned are withing spitting distance of each other from what I can tell from my usage.

another_twist 4 days ago|||

I have found copilot to be very noisy to the point where I have had to turn it off and then uninstall on both IntelliJ and VSCode. I have found generating code via your favourite coding agent and then reviewing the output to be less taxing since the review burden is reduced. For agentic, you often have to review a bunch of code but its usually very close to the spec. Reviews are then easier.

0xbadcafebee 4 days ago|||

They launched GitHub Codespaces, a free containerized dev environment with VScode & Copilot, and it's broken six ways from Sunday. VScode/Copilot extensions are constantly breaking and changing. The GitHub web interface is now much harder to use, to the point I've just stopped browsing it. Nobody over there cares if these things work. (But weirdly, the Copilot CLI works 4x better than the Copilot VSCode extension at actually writing code)

eloisant 4 days ago|||

It was kinda cool for a demo, but Claude Code really was the first game changer in AI coding.

ecshafer 4 days ago|||

Microsoft is still Microsoft.

LeoPanthera 4 days ago|||

The "smart autocomplete" part of Github Copilot is still the most useful AI coding thing for me at the moment. I continue to subscribe to it just for that.

llm_nerd 4 days ago||

Did it have all the hype and momentum, though? It was pretty widely viewed as a low- to negative-value addition, and honestly when I see someone on here talking about how useless AI is for coding, I assume they were tainted by Github copilot and never bothered updating their priors.

freedomben 4 days ago|||

just my experience of course, but it had a lot of hype. It got into a lot of people's workflow and really had a strong first mover advantage. The fact that they supported neovim as a first-class editor surely helped a ton. But then they released their next set of features without neovim support and only (IIRC) support VS Code. That took a lot of wind out of the sails. Then combined with them for some reason being on older models (or with thinking turned down or whatever), the results got less and less useful. If Co-pilot had made their agent stuff work with neovim and with a CLI, I think they'd be the clear leader.

interestpiqued 4 days ago|||

My first experience was with cursor and my entire team went through a honeymoon period before it got kind of sidelined. Average usage was giving an agent a couple shots at a problem but usually solving it ourselves ultimately. Internal demos were lackluster. Team was firmware though so might not be a great topic for GenAI yet.

softwaredoug 4 days ago||

It really says something that MS/Github has been trying to shovel Copilot down our throats for years, and Anthropic just builds a tool in a short period of time and it takes off.

It's interesting to think back, what did Copilot do wrong? Why didn't it become Claude Code?

It seems for one thing its ambition might have been too small. Second, it was tightly coupled to VS Code / Github. Third, a lot of dumb big org Microsoft politics / stakeholders overly focused on enterprise over developers? But what else?

moregrist 4 days ago||

I think the answer is pretty simple.

It's pretty clear that Microsoft had "Everything must have Copilot" dictated from the top (or pretty close). They wanted to be all-in on AI but didn't start with any actual problems to solve. If you're an SWE or a PM or whatever and suddenly your employment/promotion/etc prospects depend on a conspicuously implemented Copilot thing, you do the best you can and implement a chat bot (and other shit) that no one asked for or wants.

I don't know Anthropic's process but it produced a tool that clearly solves a specific problem: essentially write code faster. I would guess that the solution grew organically given that the UI isn't remotely close to what you'd expect a product manager to want. We don't know how many internal false-starts there were or how many people were working on other solutions to this problem, but what emerged clearly solved that problem, and can generalize to other problems.

In other words, Microsoft seems to have focused on a technology buzzword. Anthropic let people solve their own problems and it led to an actual product. The kind that people want. The difference is like night and day.

Who knows what else might have happened in the last 12 months if C-suites were focused more on telling SWEs to be productive and less on forcing specific technology buzzwords because they were told it's the future.

softwaredoug 4 days ago||

Having worked in large orgs, I can totally imagine someone having an idea like Claude Code and it getting quietly shelved because it

(A) doesn’t align to some important persons vision, who is incentivized have their finger on whatever change comes about

(B) might step on a lot of adjacent stakeholders, and the employees stakeholder may be risk adverse and want to play nice.

(D) the creators don’t want to fight an uphill battle for their idea to win.

another_twist 4 days ago|||

I have to deal with C) at $BiGTech where multiple ML teams reported to someone who never worked with ML. For machine learning, it is especially problematic since even being a good engineer requires you to understand the algorithms on some fundamental level. Now thats hard if you have never done anything remotely ML in life.

yesiamyourdad 4 days ago|||

They have copilot-cli, which is something like Claude Code, it's actually pretty effective, at least more effective than Copilot+VSCode.

I think in the end it's branding. They want people to think "Copilot = AI" but the experience is anywhere from fairly effective to absolute trash. And the most visible applications are absolute trash. It really says something when Ethan Mollick is out there demonstrating that OpenAI is more effective at working with Excel than the built in AI.

There was an article posted here yesterday that said "MS has a lot to answer for with Copilot", and that was the point: MS destroyed their AI brand with this strategy.

tomashubelbauer 4 days ago|||

I think big corporations are just structurally unable to create products people actually want to use. They have too much experience with their customers being locked in and switching costs keeping them locked in. Anthropic needed a real product to win mind-share first, they will start enshitifying later (by some accounts they may already have). The best thing a big corporation can do with a nascent technology like that is to make it available to use to everywhere and then acquire the startup that converts it to a winner first. Microsoft even fumbled that.

falloutx 4 days ago|||

Microsoft can just get one of thier devs to build a coding agent but instead all of these companies are just bowing down to Anthropic just because Anthropic is selling execs a dream situation where they can fire most of the devs. None of the other coding agents are any worse than CC, Gemini & Crush are even better, Codex is decent and even something like Opencode is catching up.

prmph 4 days ago||

Nah, Claude Code is really that better. I should know, every few months I try to move away from Claude Code, only to come running back to it.

Gemini CLI (not the model) is trash, I wish it weren't so, but I only have to try to use for a short time before I give up. It regularly retains stale file contents (even after re-prompting), constantly crashes, does not show diffs properly, etc, etc.

I recently tried OpenCode. It's got a bit better, but I still have all kind of API errors with the models. I also have no way to scroll back properly to earlier commands. Its edit acceptance and permissions interface is wonky.

And so on. It's amazing how Claude Code just nails the agentic CLI experience from the little things to the big.

Advice to agentic CLI developers: Just copy Claude Code's UX exactly, that's your starting point. Then, add stuff that make the life of user even easier and more productive. There's a ton of improvements I'd like to see in Claude Code:

- I frequently use multiple sessions. It's kinda hard to remember the context when I come back to a tab. Figure out a way to make it immediately obvious.

- Allow me to burn tokens to keep enough persistent context. Make the agent actually read my AGENTS.md before every response. Ensure thew agents gets closer and closer to matching the way I'd like it work as the sessions progresses (and even across sessions).

- Add a Replace tool, like the Read tool, that is reliable and prevents the agent from having to make changes manually one by one, or worse using sed (I've banned my agents from using sed because of the havoc they cause with it).

falloutx 3 days ago||

> Advice to agentic CLI developers: Just copy Claude Code's UX exactly, that's your starting point.

What? Its not even the best experience. The best UX is done by Crush. and they nail the experience, but its slightly worse because they made it work for all models.

llmslave 4 days ago|||

i think microsoft just doesnt have top talent building these products

firemelt 4 days ago||

because claude code do it fullstack u know, the model and implementation, the interation is seamless,

meanwhile ms and github, is waiting for any breadcrumb that chatgpt leave with

adastra22 4 days ago||

So is GitHub copilot. They run their own models.

phito 4 days ago||

Well yeah, it is just better. At my work we have a copilot license, but we use it to access Claude Sonnet/Opus model in OpenCode.

azaras 4 days ago||

The Copilot-Cli is not so bad,

https://github.com/features/copilot/cli

hpdigidrifter 4 days ago|||

Can't speak for copilot but Gemini cli is unbelievably bad compared to Gemini web.

CC has some magic secret sauce and I'm not sure what it is.

My company pays for both too, I keep coming back to Claude all-round

mcintyre1994 4 days ago|||

Claude Code is one of a very few AI tools where I genuinely think the people at the company who build it use it all the time.

giancarlostoro 4 days ago|||

They absolutely do, the CEO has come out and said a few engineers have told him that they dont even write code by hand anymore. To some people that sounds horrifying, but a good engineer would not just take code blindly, they would read it and refine it using Claude, while still saving hundreds of man hours.

lelanthran 3 days ago||

> They absolutely do, the CEO has come out and said a few engineers have told him that they dont even write code by hand anymore. To some people that sounds horrifying, but a good engineer would not just take code blindly, they would read it and refine it using Claude, while still saving hundreds of man hours.

TBH, that isn't sustainable. Skills atrophy. At some point they are going take the code blindly.

Considering what they have said in the past about agentic code changes, they are already doing just that - blindly approving code from the agent. I say this because when I last read what one of their engineers on CC tweeted/posted/whatever, I thought to myself "No human can review that many lines of code per month"[1].

---------

[1] IIRC, it was something stupid like 30kLoc reviewed in a month by a single engineer.

kamaal 3 days ago||

>>Skills atrophy.

I keep telling my friends while experienced devs feel extremely productive. The newer ones will likely not develop skills needed to work with finer aspects of code.

This might work for a while, but you do a year or two of this, and then as little as a small Python script will feel like yak shaving.

danw1979 4 days ago||||

He does !

https://x.com/bcherny/status/2007179832300581177

MarcelOlsz 4 days ago||||

I know they do because of how painfully awful the Claude web/Claude desktops uxui is, as well as performance.

taude 4 days ago||||

watch the interviews with Boris. He absolutely uses it to build CC.

vdm 4 days ago|||

s/AI//

michaelcampbell 4 days ago|||

I would love to hear/see a definitive answer for this, but I read somewhere that the relationship between MS and \A is such that the copilot version of the \A models has a smaller context window than through CC.

This would explain the "secret sauce", if it's true. But perhaps it's not and a lot is LLM nondeterminism mixing with human confirmation bias.

taude 4 days ago||||

Agreed. I was an early adopter of Claude Code. And at work we only had Copilot. But the Copilit CLI isn't too bad now. you've got slash commands for Agents.MD and skills.md files now for controlling your context, and access to Sonnet & Opus 4.5.

Maybe Microsoft is just using it internally, to finish copying the rest of the features from Claude Code.

Much like the article states, I use Claude Code beyond just it's coding capabilities....

yesiamyourdad 4 days ago||

Same situation, once I discovered the CLI and got it set up, my happiness went up a lot. It's pretty good, for my purposes at work it's probably as good as Claude Code.

tveita 4 days ago||||

The Copilot IntelliJ integration on the other hand is atrocious: https://plugins.jetbrains.com/plugin/17718-github-copilot--y...

I'm amazed that a company that's supposedly one of the big AI stocks seemingly won't spare a single QA position for a major development tool. It really validates Claude's CLI-first approach.

k__ 4 days ago|||

It's sluggish in GitHub Codespaces, as it has so many animations.

lifetimerubyist 4 days ago||

[dead]

andyjohnson0 4 days ago||

https://archive.ph/vc3Cn

fastThinking 4 days ago||

So Copilot is for customers, Claude is for getting actual work done?

monocularvision 4 days ago||

Copilot in the streets, Claude in the sheets.

k__ 4 days ago||

Copilot isn't a model, you can use Claude via Copilot.

taude 4 days ago|||

Both use the same models. But Claude Code has something special that Microsoft doesn't have in Github Copilot CLI.

thesdev 4 days ago||||

Copilot is anything you want it to be inside Microsoft. Heck even Office is Copilot nowadays.

SoftTalker 4 days ago||

Seems to be their "Watson."

theanonymousone 4 days ago||||

Neither is Copilot. The title explicitly mentions Claude "Code".

cush 4 days ago|||

I don’t think that’s what they were insinuating. Claude Code internally, Copilot for customers.

jonathanoliver 4 days ago||

Kinda reminds of the time Microsoft used git internally but was pushing Team Foundation Server.

pluralmonad 4 days ago|

There is an entire generation of devs that TFS ruined for version control. I've had to essentially rehabilitate folks and heal old TFS wounds to get them properly using git (so many copies of repos on their filesystem...).

MarcelOlsz 4 days ago||

Sounds like they would have loved git worktrees.

veryfancy 4 days ago|

GitHub Copilot with Opus 4.5 as the model is great. I have not tried Claude Code, so maybe I don’t know what I’m missing.

smithkl42 4 days ago||

I'm one of those really odd beasts that feels some sort of loyalty to Microsoft, so I started out on Copilot and was very reluctant to try Claude Code. But as soon as I did, I figured out what the hype was about. It's just able to work over larger code bases and over longer time horizons than Copilot. The last time I tried Copilot, just to compare, I noticed that it would make some number of tool calls (not even involving tokens!) and then decide, "Nah, that's too many. We're just not going to do any work for a while." It was bizarre. And sometimes it would decide that a given bog-standard tool call (like read a file or something) needed to get my permission every. single. time. I couldn't do anything to convince it otherwise. I eventually gave up. And since then, we've built all our LLM support infrastructure around Claude Code, so it would be painful to go back to anything else.

torginus 4 days ago||

I don't really like how Claude Code kind of obscures the actual code from you - I guess that's why people keep putting out articles about how certain programmers have absolutely no idea whats going on inside the code.

It's truly more capable but still not capable enough that Im comfortable blindly trusting the output.

MattGrommes 4 days ago|||

That's the big difference for me. I use Github Copilot because I want to see the output and work with it. For people who are fine just shooting a prompt out and getting code back, I'm sure Claude Code is better.

chasd00 4 days ago||||

> Claude Code kind of obscures the actual code from you

not sure what you mean, I have vscode open and make code changes in between claude doing its thing. I have had it revert my changes once which was amusing. Not sure why it did that, I've also seen it make the same mistake twice after being told not to.

tomashubelbauer 4 days ago|||

This is not a problem when you assume the role of an architect and a reviewer and leave the entirety of the coding to Claude Code. You'll pretty much live in the Git Changes view of your favorite IDE leaving feedback for Claude Code and staging what it managed to get right so far. I guess there is a leap of faith to make because if you don't go all the way and you try to code together with Claude Code, it will mess with your stuff and undo a lot of it and it's just frustrating and not optimal. But if you remove yourself from the loop completely, then indeed you'll have no idea what's going on. There still needs to be a human in the loop, and in the right part of it, otherwise you're just vibe coding garbage.

keithnz 4 days ago||

What I don't understand is why so few people talk about AugmentCode, it uses claude (and others) but builds context of your project and tends to understand your repos better.

More comments...