Top
Best
New

Posted by WXLCKNO 6 hours ago

Claude Code is being dumbed down?(symmetrybreak.ing)
622 points | 424 commentspage 2
chickensong 4 hours ago|
For a general tool that has such a broad user base, the output should be configurable. There's no way a single config, even with verbose mode, will satisfy everyone.

Set minimal defaults to keep output clean, but let users pick and choose items to output across several levels of verbosity, similar to tcpdump, Ansible, etc. (-v to -vvvvv).

I know businesses are obsessed with providing Apple-like "experiences", where the product is so refined there's just "the one way" to magically do things, but that's not going to work for a coding agent. It needs to be a unix-like experience, where the app can be customized to fit your bespoke workflow, and opening the man page does critical damage unless you're a wizard.

LLMs are already a magic box, which upsets many people. It'll be a shame if Anthropic alienates their core fan base of SWEs by making things more magical.

bayindirh 5 hours ago||
It's pretty interesting to watch AI companies start to squeeze their users as the constraints (financial, technical, capacity-wise) start to squeeze the companies.

Ads in ChatGPT. Removing features from Claude Code. I think we're just beginning to face the music. It's also funny that how Google "invented" ad injection in replies with real-time auction capabilities, yet OpenAI would be the first implementer of it. It's similar to how transformers played out.

For me, that's another "popcorn time". I don't use any of these to any capacity, except Gemini, which I seldom use to ask stuff when deep diving in web doesn't give any meaningful results. The last question I asked managed to return only one (but interestingly correct) reference, which I followed and continued my research from there.

slantaclaus 1 hour ago||
$200 a month? I buy compute credits as needed and have used maybe $300 in a year
lionkor 5 hours ago||
Meanwhile GPT-5.3-Codex which just released recently is a huge change and much better. It now displays intermediate thinking summaries instead of being silent.
fooker 5 hours ago|
My experience using it from cursor has been fairly disappointing
chairmanwow1 5 hours ago|||
Much better in the codex cli harness
roflcopter69 4 hours ago|||
There's one really confusing thing in Codex CLI from my perspective. How do I make it run unsandboxed but still ask me for approvals? I'm fine with it running bare on my machine but I like to approve first before it runs commands. But I only see how I can configure to have both or none. What am I missing?
fooker 5 hours ago|||
Interesting, I can give that a try at some point.
lionkor 5 hours ago|||
In what way(s), if you can elaborate?
fooker 5 hours ago||
Claude 4.5 or 4.6 just one shots what I ask instead of getting stuck in random tangents.
hirako2000 6 hours ago||
Sounds like the compacting issue.

> Compacting fails when the thread is very large

> We fixed it.

> No you did not

> Yes now it auto compacts all messages.

> Ok but we don't want compaction when the thread isn't large, plus, it still fails when the compacted thread is too large

> ...

Joel_Mckay 5 hours ago|
Let me fix that for you:

> Compacting fails when the thread is very large

Flips coin, it is Heads

> We fixed it.

> No you did not

Flips coin, it is Tails

> Yes now it auto compacts all messages.

Flips coin, it is Heads

> Ok but we don't want compaction when the thread isn't large, plus, it still fails when the compacted thread is too large

Flips coin, it is Grapefruit

> ...

Congratulations on a vibe solution, if you are unhappy with the frequency of isomorphic plagiarism... the vendor still has your money and new data =3

testfrequency 2 hours ago||
I agree the quality of Claude Code recent has felt poor and frustrating.

I’ve been persistently dealing with the agent running in circles on itself when trying to fix bugs, not following directions fully and choosing to only accomplish partial requests, failing to compact and halting a session, and ignoring its MCP tooling and doing stupid things like writing cruddy python and osascripts unnecessarily.

I’ve been really curious about codex recently, but I’m so deep into Claude Code with multiple skills, agents, MCPs, and a skill router though.

Can anyone recommend an easy migration path to codex as a first time codex user from Claude code?

elzbardico 5 hours ago||
This was really useful; sometimes, by a glance, you'd see Claude looking at the wrong files or searching the wrong patterns, and would be able to immediately interrupt it. For those of us who like to be deeply involved in what Claude is doing, those updates were terribly disappointing.
g-mork 5 hours ago||
Absolutely worse than dumbed down, 4.6 is a mess. Ask it the simplest of questions, look away, and come back to 700 parallel tool uses. https://old.reddit.com/r/ClaudeAI/comments/1r1cfha/is_anyone...
Retr0id 6 hours ago||
I also found this change annoying.

Often a codebase ends up with non-authoritative references for things (e.g. docs out of sync with implementation, prototype vs "real" version), and the proper solution is to fix and/or document that divergence. But let's face it, that doesn't always happen. When the AI reads from the wrong source it only makes things worse, and when you can't see what it's reading it's harder to even notice that it's going off track.

locusofself 5 hours ago|
Working at Microsoft, I've just now hooked up to Claude Code (my department was not permitted to use it previously), through something called "Agent Maestro", a vscode extension which I guess pipes claude code API requets to our internally hosted Claude models, including Opus 4.6.

I do wonder if there is going to be much of a difference between using Claude Code vs. Copilot CLI when using the same models.

nfg 5 hours ago||
> I do wonder if there is going to be much of a difference between using Claude Code vs. Copilot CLI when using the same models.

I’m also at MS, not (yet?) using Claude Code at work and pondering precisely the same question.

cactusplant7374 4 hours ago|||
Is this an indictment of OpenAI's models -- that Microsoft has access to through their investment?
locusofself 4 hours ago||
We've had both GPT and Claude models available to us in Github Copilot for some time. At first, it was only GPT models.
pletnes 5 hours ago|||
I honestly don’t think the models are as important as people tend to believe. More important is how the models are given tools - find, grep, git, test runners, …
Galanwe 4 hours ago||
> I honestly don’t think the models are as important as people tend to believe.

I tend to disagree. While I don't see meaningful _reasoning power_ between frontier models, I do see differences in the way they interact with my prompts.

I use exclusively Anthropic models because my interactions with GPT are annoying:

- Sonnet/Opus behave like a mix of a diligent intern, or a peer. It does the work, doesn't talk too much, gives answers, etc.

- GPT is overly chatty, it borderline calls me "bro", tend to brush issues I raise "it should be good enough for general use", etc.

- I find that GPT hardly ever steps back when diagnosing issues. It picks a possible cause, and enters a rabbit hole of increasingly hacky / spurious solutions. Opus/Sonnet is often to step back when the complexity increases too much, and dig an alternative.

- I find Opus/Sonnet to be "lazy" recently. Instead of systematically doing an accurate search before answering, it tries to "guess", and I have to spot it and directly tell it to "search for the precise specification and do not guess". Often it would tell me "you should do this and that", and I have to tell it "no, you do it". I wonder if it was done to reduce the number of web searches or compute that it uses unless the user explicitly asks.

0xbadcafebee 4 hours ago||
Compare their system prompts and the agent harness logic. It's 99% of what makes the agent useful, and it can be quite different.
More comments...