Posted by speckx 1 day ago
If you really want to see a radical shakeup that would have some very exciting effects, could I interest you in a little Total Atomic Anihilation?
The napkin math resulted that renting is around 27 times cheaper than owning (not including power). I think we're really screwed when it comes to having owned access to AI unless intel comes out swinging with a c series card that has 128gb vram so we can run them in a 4x128gb configuration, but seems unlikely since nvidia has a large share in them.
This was calculated expecting around 30tok/s, of course you can get 2-5tok/s much much cheaper, but it's unusable for my workflow.
Everyone else charges a ridiculous amount but Deepseeks API is $0.003625 / M tok.
I'm surprised no one talks about this because of how significant it is. GPT 5.5 for example costs a ridiculous $0.50 / M tok cached. It's literally almost 140 times cheaper which matters a lot for tool calls.
> The deepseek-v4-pro model is currently offered at a 75% discount, extended until 2026/05/31 15:59 UTC.
However even when the discount ends its still very cheap. It will go back to $0.0145 / M cache hit. That's still 34x cheaper than GPT 5.5.
If you serve a single user you'll never get your electricity price back, nevermind hardware costs.
> replace them with terminals that stream everything to the cloud
they've been trying for a looong time on that one. i still remember those junky "net appliance"s from the early 2000s [0] and oracle and sun making big statements about them...[0] https://www.ecommercetimes.com/story/sonys-evilla-joins-audr...