Top
Best
New

Posted by adocomplete 12 hours ago

Claude Sonnet 4.6(www.anthropic.com)
https://www.anthropic.com/claude-sonnet-4-6-system-card [pdf]

https://x.com/claudeai/status/2023817132581208353 [video]

955 points | 859 commentspage 5
excerionsforte 10 hours ago|
I'm impressed with Claude Sonnet in general. It's been doing better than Gemini 3 at following instructions. Gemini 2.5 Pro March 2025 was the best model I ever used and I feel Claude is reaching that level even surpassing it.

I subscribed to Claude because of that. I hope 4.6 is even better.

baalimago 10 hours ago||
I don't see the point nor the hype for these models anymore. Until the price is reduced significantly, I don't see the gain. They've been able to solve most tasks just fine for the past year or so. The only limiting factor is price.
reed1234 10 hours ago|
Efficiency matters too. If a model is smarter so it solves the same task with fewer tokens, that matters more than $/Mtok
nubg 11 hours ago||
My take away is: it's roughly as good as Opus 4.5.

Now the question is: how much faster or cheaper is it?

Bishonen88 11 hours ago||
40% cheaper: https://platform.claude.com/docs/en/about-claude/pricing
amedviediev 10 hours ago|||
But what about real price in real agentic use? For example, Opus 4.5 was more expensive per token than Sonnet 4.5, but it used a lot less tokens so final price per completed task was very close between the two, with Opus sometimes ending up cheaper
worldsavior 11 hours ago|||
How does it work exactly? How this model is cheaper and has the same perf as Opus 4.5?
red2awn 7 hours ago|||
Distilling from a teacher (Opus 4.5) and scaling RL more.
anthonypasq 11 hours ago|||
this is called progress
worldsavior 8 hours ago|||
I'm asking technically how progress works. What is actually being improved here
metaltyphoon 10 hours ago|||
Or, we can bleed out cash for a very long time.
sxg 11 hours ago|||
How can you determine whether it's as good as Opus 4.5 within minutes of release? The quantitative metrics don't seem to mean much anymore. Noticing qualitative differences seems like it would take dozens of conversations and perhaps days to weeks of use before you can reliably determine the model's quality.
johntarter 10 hours ago||
Just look at the testimonials at the bottom of introduction page, there are at least a dozen companies such as Replit, Cursor, and Github that have early access. Perhaps the GP is an employee of one of these companies.
vidarh 11 hours ago|||
Given that the price remains the same as Sonnet 4.5, this is the first time I've been tempted to lower my default model choice.
freeqaz 11 hours ago|||
If it maintains the same price (with Anthropic tends to do or undercuts themselves) then this would be 1/3rd of the price of Opus.

Edit: Yep, same price. "Pricing remains the same as Sonnet 4.5, starting at $3/$15 per million tokens."

Bishonen88 11 hours ago||
3 is not 1/3 of 5 tho. Opus costs $5/$25
eleventyseven 11 hours ago||
> That's a long document.

Probably written by LLMs, for LLMs

adt 11 hours ago||
https://lifearchitect.ai/models-table/
taytus 3 hours ago||
Honest question: why would anyone use Opus instead of this? I’m doing web development, the whole shebang, and I don’t think I need Opus right now. I know it’s supposed to be smarter, but a 2%–5% improvement doesn’t seem meaningful, especially when it costs more than double and has only a portion of the context window.

Am I getting this wrong? I would seriously appreciate any clarification here.

simianwords 11 hours ago||
I wonder what difference have people found with sonnet 4.5 and opus 4.5 and probably similar delta will remain.

Was sonnet 4.5 much worse than opus?

dpe82 11 hours ago|
Sonnet 4.5 was a pretty significant improvement over Opus 4.
simianwords 11 hours ago||
Yes but it’s easier to understand difference between 4.5 sonnet and opus and apply that difference to opus 4.6
dr_dshiv 10 hours ago||
I noticed a big drop in opus 4.6 quality today and then I saw this news. Anyone else?
micw 10 hours ago|
I'd say opus 4.6 was never better for me than opus 4.5. only more thinking, slower, more verbose but succeeded on the same tasks and failed on the same as 4.5.
andrewchilds 10 hours ago||
You're not alone: https://github.com/anthropics/claude-code/issues/23706
doctorpangloss 11 hours ago||
Maybe they should focus on the CLI not having a million bugs.
esafak 9 hours ago|
It actually looked at the skills, for the first time.
More comments...