Top
Best
New

Posted by adocomplete 10 hours ago

Claude Sonnet 4.6(www.anthropic.com)
https://www.anthropic.com/claude-sonnet-4-6-system-card [pdf]

https://x.com/claudeai/status/2023817132581208353 [video]

889 points | 801 commentspage 4
quacky_batak 9 hours ago|
With such a huge leap, i’m confused why they didn’t call it Sonnet 5? As someone who uses Sonnet 4.5 for 95% tasks due to costs, i’m pretty excited to try 4.6 at the same price
Retr0id 9 hours ago||
It'd be a bit weird to have the Sonnet numbering ahead of the Opus numbering. The Opus 4.5->4.6 change was a little more incremental (from my perspective at least, I haven't been paying attention to benchmark numbers), so I think the Opus numbering makes sense.
Sajarin 9 hours ago||
Sonnet numbering has been weirder in the past.

Opus 3.5 was scrapped even though Sonnet 3.5 and Haiku 3.5 were released.

Not to mention Sonnet 3.7 (while Opus was still on version 3)

Shameless source: https://sajarin.com/blog/modeltree/

cobolexpert 3 hours ago||
I like this tree visualization! The background with little squares is making the text difficult to read, though.
yonatan8070 9 hours ago||
Maybe they're numbering the models based on internal architecture/codebase revisions and Sonnet 4.6 was trained using the 4.6 tooling, which didn't change enough to warrant 5?
mfiguiere 9 hours ago||
In Claude Code 2.1.45:

  1. Default (recommended)   Opus 4.6 · Most capable for complex work
   2. Opus (1M context)        Opus 4.6 with 1M context · Billed as extra usage · $10/$37.50 per Mtok
   3. Sonnet                   Sonnet 4.6 · Best for everyday tasks
   4. Sonnet (1M context)      Sonnet 4.6 with 1M context · Billed as extra usage · $6/$22.50 per Mtok
michaelcampbell 9 hours ago|
Interesting. My CC (2.1.45) doesn't provide the 1M option at all. Huh.
minimaxir 8 hours ago||
Is your CC personal or tied to an Enterprise account? Per the docs:

> The 1M token context window is currently in beta for organizations in usage tier 4 and organizations with custom rate limits.

minimaxir 3 hours ago|||
Update: On my personal Claude Code I have access to the 1M model endpoints, so I'm confused.
michaelcampbell 8 hours ago|||
The one I'm looking at right now some is sort of company level sub, so they probably have the upcharge options turned off.

Thanks!

belinder 10 hours ago||
It's interesting that the request refusal rate is so much higher in Hindi than in other languages. Are some languages more ambiguous than others?
vessenes 10 hours ago||
Or some cultures are more conservative? And it's embedded in language?
phainopepla2 9 hours ago||
Or maybe some cultures have a higher rate of asking "inappropriate" questions
vessenes 9 hours ago||
According to whom, though, good sir??

I did a little research in the GPT-3 era on whether cultural norms varied by language - in that era, yes, they did

longdivide 10 hours ago|||
Arabic is actually higher, at 1.08% for Opus 4.6
andrewmcwatters 9 hours ago||
[dead]
astlouis44 9 hours ago||
Just used Sonnet 4.6 to vibe code this top-down shooter browser game, and deployed it online quickly using Manus. Would love to hear feedback and suggestions from you all on how to improve it. Also, please post your high scores!

https://apexgame-2g44xn9v.manus.space

nerdralph 4 hours ago||
The mouse is invisible on the splash screen, except for when I manage to move it over the play button.
Flowsion 9 hours ago|||
That was fun, reminded me of some flash games I used to play. Got a bit boring after like level 6. It'd be nice to have different power-ups and upgrades. Maybe you had that at later levels, though!
Dowry9092 8 hours ago||
Power-ups or scaling weapons would be fun! Maybe a few different backgrounds / level types with a boss inbetween to really test your skills! Minigun OP IMO.
astlouis44 7 hours ago||
Updated version: https://apexgame-2g44xn9v.manus.space/
KGC3D 7 hours ago||
I don't really understand why they would release something "worse" than Opus 4.6. If it's comparable, then what is the reason to even use Opus 4.6? Sure, it's cheaper, but if so, then just make Opus 4.6 cheaper?
acuozzo 7 hours ago|
It's different. Download an English book from Project Gutenberg and have Claude-code change its style. Try both models and you'll see how significant the differences are.

(Sonnet is far, far better at this kind of task than Opus is, in my experience.)

baalimago 8 hours ago||
I don't see the point nor the hype for these models anymore. Until the price is reduced significantly, I don't see the gain. They've been able to solve most tasks just fine for the past year or so. The only limiting factor is price.
reed1234 8 hours ago|
Efficiency matters too. If a model is smarter so it solves the same task with fewer tokens, that matters more than $/Mtok
excerionsforte 9 hours ago||
I'm impressed with Claude Sonnet in general. It's been doing better than Gemini 3 at following instructions. Gemini 2.5 Pro March 2025 was the best model I ever used and I feel Claude is reaching that level even surpassing it.

I subscribed to Claude because of that. I hope 4.6 is even better.

nubg 10 hours ago||
My take away is: it's roughly as good as Opus 4.5.

Now the question is: how much faster or cheaper is it?

Bishonen88 9 hours ago||
40% cheaper: https://platform.claude.com/docs/en/about-claude/pricing
amedviediev 8 hours ago|||
But what about real price in real agentic use? For example, Opus 4.5 was more expensive per token than Sonnet 4.5, but it used a lot less tokens so final price per completed task was very close between the two, with Opus sometimes ending up cheaper
worldsavior 9 hours ago|||
How does it work exactly? How this model is cheaper and has the same perf as Opus 4.5?
red2awn 5 hours ago|||
Distilling from a teacher (Opus 4.5) and scaling RL more.
anthonypasq 9 hours ago|||
this is called progress
worldsavior 6 hours ago|||
I'm asking technically how progress works. What is actually being improved here
metaltyphoon 8 hours ago|||
Or, we can bleed out cash for a very long time.
sxg 10 hours ago|||
How can you determine whether it's as good as Opus 4.5 within minutes of release? The quantitative metrics don't seem to mean much anymore. Noticing qualitative differences seems like it would take dozens of conversations and perhaps days to weeks of use before you can reliably determine the model's quality.
johntarter 8 hours ago||
Just look at the testimonials at the bottom of introduction page, there are at least a dozen companies such as Replit, Cursor, and Github that have early access. Perhaps the GP is an employee of one of these companies.
vidarh 9 hours ago|||
Given that the price remains the same as Sonnet 4.5, this is the first time I've been tempted to lower my default model choice.
freeqaz 10 hours ago|||
If it maintains the same price (with Anthropic tends to do or undercuts themselves) then this would be 1/3rd of the price of Opus.

Edit: Yep, same price. "Pricing remains the same as Sonnet 4.5, starting at $3/$15 per million tokens."

Bishonen88 9 hours ago||
3 is not 1/3 of 5 tho. Opus costs $5/$25
eleventyseven 10 hours ago||
> That's a long document.

Probably written by LLMs, for LLMs

adt 10 hours ago|
https://lifearchitect.ai/models-table/
More comments...