Top
Best
New

Posted by simonw 1 hour ago

I think Anthropic and OpenAI have found product-market fit(simonwillison.net)
125 points | 133 comments
trjordan 1 hour ago|
They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down.

This means we're going to need $1t+ per year in spending, per year, on tokens. 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

solenoid0937 1 minute ago||
> 20% if you're a developer. That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

Of course it will. The value of an employee is a multiple of what they get paid.

If you pay an employee $500k and they make $2M for your company (like Meta), then of course a 20% increase for the salary is justified if the velocity is increased 20% as well.

regularfry 13 minutes ago|||
The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build. The more of the latter they can take on, the fewer knowledge workers are needed at all. So rather than 5% of every knowledge worker's salary going into tokens, 100% of the knowledge worker's total employment cost goes into tokens and you get a 20x productivity boost as a theoretical minimum across those tasks.

That's the game. There's a view you could take of this that this is just a growing of the pie: with those cost dynamics a lot more "small businesses" get a vast amount of leverage, so the overall economy grows without replacing the knowledge workers. I'm not sure I trust the MBA class to have that view.

seanp2k2 16 seconds ago|||
>The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build

I would argue that that's been the case for quite some time before AI. As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade with their very high numbers of very talented and highly-compensated engineers? The issue with most big tech companies are leadership, strategy, and product direction. I'm not saying that they don't make any profits, just that they probably aren't "building [the right thing]".

AI for product development and management would be far more impactful than automating rote coding tasks / building React UIs that mirror API structures IMO.

layer8 9 minutes ago|||
Who pays for that value, and from what, if all knowledge workers lose their jobs?

It sounds like the economy would largely reduce to the small minority class of independently wealthy people.

onlyrealcuzzo 34 minutes ago|||
> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens.

They are assuming ~10% global GDP growth instead of ~3%. You probably don't need the same %s if the pie grows a ton.

I'm highly skeptical we get that growth, but if you aren't, it makes it easier to digest.

freakynit 14 minutes ago||
I mean this case with AI-productivity fires itself back when we talk about GDP.

The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently.

A third effect also comes into play that once all this starts to happen, common people, who are generally living paycheck to paycheck, will now start to hesitate towards making any long term investment, housing included. And that indirectly will end up impacting financial and banking sector, which will then impact existing savings, bonds yields and retirement funds, and the recession-like cycle starts.

This productivity increase only makes sense if it is capped to a very small number.. like 20% max. Beyond that, who these companies will even be selling to?

Am I overthinking all this?

simonw 10 minutes ago||
> The more AI causes productivity increases, the less and less number of workers will be needed.

That only holds if companies have a fixed need for "productivity" which is met by their current employees, such that their employees becoming more productive means they need less of them.

Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

But generally yes, the biggest open question about all of this is how the impact will play out on the economy, job opportunities etc. I've not seen anyone come close to a confident prediction about how this will play out.

jbreckmckye 3 minutes ago||
> Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

I mean sure. Every company wants an infinite addressable market. But that doesn't mean it exists.

It might not be possible to sell 10x the software we sell today. It might not even be possible to sell 2x

jgbuddy 37 minutes ago|||
You are making the assumption that the models are only used / paid for by 2.5% of the population (your knowledge workers value). There will be new value created by these models which people are happy to pay for which simply did not exist at all before. It is also naive to say that the hyperscalers are going to be expecting a return on this in 5 years, it will be entirely propped up by investments / IPOs as has been the case with any tech company for decades now to reach scale. The hyperscalers are currently spending ~650b combined annually, which they have the cash for and can sell in future compute instantly.
specproc 8 minutes ago||
I'm sorry, what the feck does "value creation" mean here? I live in a place where people are so, insanely squeezed from every angle. Wages are stagnant, prices rocketing. Where is the money to pay for this value going to come from?

No one I know feels richer than they did a decade back. I've not been able to meaningfully put up my prices for a decade. People are tired and stressed and scared, particularly scared of a technology everyone keeps telling them will make them redundant.

There is no rising tide lifting all boats, just most of us drowning whilst a few whizz past in their yachts.

I honestly hope these guys faceplant ASAP. Couldn't happen to a nicer bunch of people.

TimTheTinker 10 minutes ago|||
I thought Anthropic and OpenAI's combined CapEx has been <100B?

source: https://isaiprofitable.com/

browningstreet 26 minutes ago|||
Somehow Uber and WeWork survived the same kind of grand projections that they never met.
121789 21 minutes ago|||
uber sure....but how did wework survive? they are a smoldering husk of a failed company looted by its founder
naravara 10 minutes ago||
The company’s gone but the assets just got sold to other commercial real estate firms.

Uber was basically only ever software to help people use their own cars so a very small part of their valuation was physical stuff to upkeep, it was just deals and obligations they had.

Not sure how it shakes out for Anthropic and OpenAI. There’s a lot of physical capacity that needs to be built out and can depreciate. But there’s also a lot of network effects and dependencies being built in with enterprise users.

I don’t know how swappable the tooling is either. I think over the long term the UI, model training and documentation, and infrastructure are going to end up being run by different parties and I’m not sure which leg of that chain ends up in a position to skim most of the profit off. My guess is that Apple and Google end up raking in all the money since they control the OS and app stores while the rest of the stack gets driven down to being generic commodities. At least where mass market consumer adoption is concerned.

windexh8er 12 minutes ago||||
The difference is that they had room to charge more of their customers and pay less to their workers. The AI industry doesn't have both sides to play at this point. Training and inference are getting more expensive and if you take on the high prices now you're just floating yourself further downstream from profitability long term (which does not look viable for any of them currently).
tapoxi 12 minutes ago||||
I don't think Uber was doing $1 trillion in infrastructure spend.
paxys 14 minutes ago||||
WeWork absolutely did not survive
xoac 25 minutes ago||||
somehow the invisible hand of the market is also blind af
ArcHound 6 minutes ago||
Makes sense if you think about it: if all photons pass through you (invisible) then you can't capture them to get info (blind).
seniorThrowaway 11 minutes ago|||
[dead]
sowbug 22 minutes ago|||
There is also the EV (expected value) of developing AGI. Even if you personally believe the probability is low within the lifetime of either of these companies, the value would still be extraordinarily high, enough to forgive a $5T or so miscalculation here or there.
jbreckmckye 17 minutes ago||
I don't think AGI was ever a serious endeavour, just something the labs talked up to grab attention.

I am willing to bet a Twix we'll look back on that stuff in 2 years with a lot of embarrassment

jmyeet 12 minutes ago|||
YEPPP... and I'm kind of shocked at how many people can't do simple math.

Let's put it context. Google's annual revenue seems to be north of $400B. So if OpenAI suddenly had Google's revenue, it would still be insufficient to recover their investment.

and it's a ticking time bomb because $1T in servers, CPUs, GPUs and memory is going to be worth $200B in 5 years. You can say they can keep using what they've got. Sure. But they're also not going to stop spending on new hardware. And the competitor that comes along in 5 years and spends $1T doing the exact same thing is going to have a huge advantage.

OpenAI at this point reminds me very much of the Russ Henneman pre-money hype cycle.

hansmayer 4 minutes ago||
This should be the top comment. Also, I think its not that many people, including our Simon here, are not good at math. Its more like, some of them seem to be incentivised to not be caugh, caugh, "good at math". How else will the hype sell?
EGreg 58 minutes ago|||
Here is a serious question.. Can we sell into the hype cycle and on the way down with this: https://safebots.ai/costs.html
adithyassekhar 42 minutes ago||
I asked claude to generate a frontend and it made the same template. Same san serif and serif fonts together. Same colors. Same typography. Same layout and animations even. It’s wild how similar it is. No not similar it’s the same damn thing.
dd8601fn 28 minutes ago||
I’ve seen the same dashboard for a dozen custom web applications now, including a couple I had it make for me.

It really does have a particular lane for each chore, and it’s reproducible.

properbrew 3 minutes ago||
Yep and when you see it in the wild it stands out like a sore thumb, absolutely no thought into a bit of a unique design or branding.

I have a few live websites built using LLMs and they will just go for default generic templates and colours if there's no vision.

HDThoreaun 35 minutes ago|||
Source on 200 million knowledge workers worldwide? My understanding is that it's just above 1 billion. I dont think a billion subscriptions at $1000/yr is out of the question but it might take a decade to get roiling
swatcoder 8 minutes ago|||
You're suggesting that 1 in 8 people worldwide, including every one from infants and the elderly, are knowledge workers. Are you sure that's what you mean?

I'm not even sure that 1 in 8 people I know would qualify as a knowledge worker, let alone a knowledge worker that might profoundly benefit from on-the-horizon AI. And I'm in a highly skewed population.

rootusrootus 29 minutes ago|||
A billion? Really? At 200M you’re already including a lot of people that stretch the definition of knowledge worker.
naravara 6 minutes ago|||
A lot of those ‘edge cases’ in the definition of “knowledge worker” are probably the stuff that’s most likely to have significant parts of the work augmented or replaced by AI agents. Like, call-centers are almost certainly going to get turned over in a big way. It’s not like the median tier-1 support operator just reading off a script is much better than an LLM anyway.
esseph 19 minutes ago||||
Yeah, just looked into this. Knowledge workers is a big group and probably much larger than you think it is.

Basically if you're not doing manual labor, it's probably knowledge work.

Roughly 1/3rd of the working population.

Some data tucked in here: https://gist.github.com/danielmiessler/2dc039762a202b083753b...

HDThoreaun 23 minutes ago|||
> At 200M you’re already including a lot of people that stretch the definition of knowledge worker.

How do you know this? Im certainly open to recalibrating my numbers which is why I asked for the source

windexh8er 16 minutes ago||
What's your source, because it looks wildly out of proportion compared to numbers we have now.
elliotec 5 minutes ago|||
Here's a source from 2019 that says: "By 2023, the number of knowledge workers in the world will increase to 1.14 billion, with more than four-fifths of that growth coming from the emerging world."

https://www.gartner.com/en/newsroom/press-releases/09-24-201...

HDThoreaun 12 minutes ago|||
I googled "number of knowledge workers worldwide" and read the top results. If you read it as I was confident in a billion I apologize, Im just trying to get an accurate count. What numbers do you have now and where did you find them?
YetAnotherNick 39 minutes ago|||
> $5t to $10t to make back in the next 5 years

Wait what? They spent 2 order of magnitude less on hardware.

trjordan 38 minutes ago||
From the verge: https://archive.is/kU4Zg

> Gartner forecasts that large AI companies would need to earn cumulatively close to $7 trillion in AI-driven revenue through 2029, which is close to $2 trillion per year by the end of the period. In order to achieve “historic returns,” the providers would need to earn nearly $8.2 trillion in the same period.

YetAnotherNick 33 minutes ago|||
Those numbers don't even track even in the same sentence. If it is $2T/year by the end of 2029, it would be something < $6T cumulative in 3 years.
b0r3dthisD4y 31 minutes ago|||
The numbers are made up political correctness anyway.

Everyone's agency is 100% captured by belief in Wall Street. Too few <50 have any meaningful labor skills to blink.

We'll continue to have consent manufactured via media platforms and in 3 years no one will bat an eye at these companies being worth $12 trillion as Altman and Musk climb two ladders holding a "mission accomplished" banner.

ar_lan 30 minutes ago||
> unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

Simple - you make them work 2x, 5x, or 10x more hours.

OtomotO 26 minutes ago||
There are not enough hours to do that
hansmayer 1 minute ago||
> I currently subscribe to the $100/month Max plan from Anthropic and the $100/month Pro plan from OpenAI. If you are a heavy user of coding agents these plans are a fantastic deal.

Agreed. But its only a great deal because it is heavily subsidized, as you said yourself. Enjoy while it lasts, but in my book, product-market fit means something along the lines of "product which enjoys a loyal customer base, sold at a price perceived fair by the customers, and generating profit. How many of these does your definition of product-market fit hit here?

prepend 1 hour ago||
> $2,180.16 worth of tokens for $200

“Tokens” don’t have an intrisic cost or value. Saying that I used $2,180.16 worth of tokens is like relying on the salesperson to convince me I’m getting a billion dollars worth of pots and pans for $19.99.

I think it’s funny how we are throwing critical thinking out the window when it comes to evaluating biased sources of info.

simonw 1 hour ago||
I'm not sure what you're pushing back against here.

I spent $200. If I had been paying API pricing it would have been $2,180.16. The article is about how enterprise customers get charged API pricing, which means if I had been employed by one of those companies I would have cost them $2,180.16.

What am I missing?

eqvinox 24 minutes ago|||
Just because API pricing would've been $2180.16 doesn't mean that's the value of those tokens. For starters, you personally probably wouldn't have paid that. But also, sales price isn't value. This is like saying, oh, I saw this bar of gold somewhere for $10000 but got it here for $1000! So I got $10000 worth of gold for $1000! - no, the value of that gold is determined by its weight, which wasn't even mentioned.

We have no market convergence on tokens yet (and it'll differ between LLMs), so it's impossible to say what value you got for your $200.

OrangeDelonge 52 minutes ago||||
Large enterprises make deals and won’t be paying 2,180.16$ either. Just like with AWS
simonw 46 minutes ago|||
That doesn't seem to be the case. From what I've seen enterprise deals get API pricing now. Have you seen evidence that's not true?
roomey 12 minutes ago|||
Hi Simon, nice article. The parent there may be making the same assumption I am, that large enterprise _never_ pays sticker price.

Also, to just color in the picture here, as I haven't seen it mentioned elsewhere, there is a very large Saas company at the moment who has given everyone unlimited tokens on Claude. And they have a dashboard showing who spends the most. So the "budget" went from about USD500 per per person (split between Claude and cursor) in Jan to... Well a soft limit of USD100k... Per month... Per person.

People can still see the top line sticker price on their spend, but honestly I can't believe that the Saas is paying that full price when the invoice comes in.

That said, there are some finance reports which are probably dropping soon where we will find out!

simonw 7 minutes ago||
> The parent there may be making the same assumption I am, that large enterprise _never_ pays sticker price.

I shared that assumption until yesterday, when I found out that it wasn't holding for LLM pricing from OpenAI and Anthropic. That's what inspired me to write this piece.

I think those token leaderboards are an obviously terrible idea and will go extinct very quickly now that people are paying attention to costs.

themgt 29 minutes ago|||
I do know of moderate-size companies deploying OSS LLMs on their own GPU clusters, for ownership/security/maybe cost reasons. I'm somewhat surprised F500 companies are apparently just handing over all their data to the model providers.

Could be fantastic for small shops while it lasts. The big guys have to pay 10x for precious tokens.

Anon1096 20 minutes ago||||
Claude is so in demand at the moment that there aren't really volume discounts. Anthropic sets the terms and you either accept them or get lost they have that much of a lead (mindshare/desirability wise).
waisbrot 46 minutes ago||||
And "large" just means that AWS will assign an account manager to talk with you. I was at a start-up who spent $300k/year on AWS and that was enough to get special attention and discounts. Enterprise pricing is confusing.
apsurd 45 minutes ago|||
The point is that those a real prices real people are paying for real API usage. it's not made up.

your point is large players won't pay those prices at massive volume. ok

altruios 45 minutes ago||||
> If I had been paying API pricing it would have been $2,180.16

The point being made above is that API pricing is calculated... somehow... seemingly arbitrarily. Possibly untethered to the infrastructure costs entirely: which would be the basis of any 'value', however that holds the labor theory of value, which isn't accurate either. So how do you accurately price these tokens at all (other than through price-discovery: which is slow, messy and fuzzy)?

NitpickLawyer 36 minutes ago|||
> So how do you accurately price these tokens at all

Like anything else in the economy: at the point where enough customers can pay you, and not enough will go to the cheaper competition.

827a 13 minutes ago|||
I love HackerNews. God its fantastic. Only on HackerNews can you find these deranged personalities who think the pricing model of a near-trillion dollar company is determined "seemingly arbitrarily".
simonw 5 minutes ago||
Fun fact: the $20/month subscription fee for ChatGPT Pro - which set the standard for at least a couple of years - really was an arbitrary decision made based on a Google form: https://simonwillison.net/2025/Aug/12/nick-turley/
pembrook 39 minutes ago|||
API pricing drops DRAMATICALLY in enterprise agreements.

As with pretty much anything priced on volume/usage.

Enterprise deals are negotiated ad-hoc, the listed pricing is simply a jumping off point for the final negotiated discount.

If you’re going to give 20,000 employees Claude code you are not going to be spending $1B per year on Anthropic tokens as if you gave everyone an individual API key. Just as Anthropic isn’t paying AWS SES $10,000,000 to send 1 email update to their massive user base when the next Claude version drops.

taude 23 minutes ago|||
This isn't true at the moment, though. So far there hasn't been the negotiating power. What happens is you end up capping usage for employees at a fixed amount. I think eventually, prices will come down and there will be discounts, but for enterprise accounts at least of our size (<5000), we're paying almost 100% retail, which kind of sucks, because it's expensive, and pretty easy to burn $50 to $100+ in a day, if you're not careful. In fact we got pushed off the former plan to the token-utility one at the last contract negotiation.

Going to be interesting to determing the metrics we give to engineers for determining whether the spend on this is worth it. Measuring PRs, lines of code committed, commits fully generated by agentic workflows, etc.....

simonw 35 minutes ago|||
> API pricing drops DRAMATICALLY in enterprise agreements

Do you have any numbers or reports to back that up?

troyastorino 58 minutes ago|||
Tokens do have a clearly calculable intrinsic cost. There's the marginal cost of production (i.e. the inference cost) and the amortized R&D cost that goes into the model producing them.

Yes, value is hard to calculate, but luckily market pricing mechanisms exist exactly for this purpose. There isn't a better number to use than what people are willing to pay for them.

So he's saying that on an enterprise plan, he'd be spending $2,180.16. He's not paying that much, but enterprises are.

john_strinlai 53 minutes ago|||
a little critical thinking led me to read that sentence as $2180 worth of tokens [at current api pricing]
jfrbfbreudh 28 minutes ago|||
Lol. They obviously have intrinsic cost, the floor being the cost of electricity. It’s hilarious how we are throwing critical thinking out the window when it comes to evaluating biased sources of info.
FergusArgyll 48 minutes ago|||
I think it's funnier that you can believe some things have an intrinsic cost and others don't
dylan604 59 minutes ago||
[dead]
realo 57 minutes ago||
200$ per month per seat is nothing .

A single 3D CAD license pack for the guys in our R&D group costs multiple thousands of dollars per seat, per month.

It's about time software seats get some love too.

smokel 42 minutes ago||
AutoCAD is $175 per user per month [1].

[1] https://www.autodesk.com/products/autocad/buy

bigbuppo 2 minutes ago||
AutoCAD is still the budget-friendly CAD program it has always been. You don't build big boats in AutoCAD.
chatmasta 45 minutes ago|||
Yeah, it’s nothing, and it’s also not the cost that enterprises are paying. As the article states, the price is $20 per seat per month, PLUS per-token API usage. Enterprises are paying consumption billing, not fixed rate oversubscribed “all you can eat per seat.”
avree 28 minutes ago|||
CATIA licenses which are the most expensive I've seen are roughly $600/month per user. Where are you seeing "thousands of dollars per seat"?
AlotOfReading 5 minutes ago||
CFD might reasonably be considered part of CAD and something like ansys costs about as much as catia. Still only doubles it though.
esafak 38 minutes ago||
How many guys is that? Every single white color worker is in the AI ICP.
smt88 30 minutes ago||
white collar*, not color

What does ICP mean?

simonw 25 minutes ago|||
Insane Clown Posse, though given the context here probably Ideal Customer Profile.
aerhardt 4 minutes ago||
I find this analysis confusing. PMF for coding was likely reached some time last year. Profitability, which is different, we don’t know. The article kind of confuses both without making a strong economic case or using numbers in a compelling way. I don’t understand what the Uber case has to do with this either. The Uber COO clearly said that at least in terms of ROI he’s not seeing the results either.

My take is the product has been very useful for coding (PMF) for months. But it’s certainly not useful at any cost…

hansmayer 6 minutes ago||
> Anthropic are strongly rumored to be about to have their first profitable quarter

No, its more like their own leak to WSJ and according to Ed Zitron -> seems to be heavily engineered via non-GAAP practices such as counting potential, but not realised revenue as actual revenue - the stuff for which I would be arrested if I did it at my company.

Also it appears according to Ed's analysis - strangely they seem to be projecting only that one quarter as profitable - potentially to calm the investors ahead of the IPO. Investor fraud anyone?

binary0010 1 hour ago||
So how do openai and anthropic plan to keep customers when GLM-5.1 is just as good and open source and a lot cheaper?

I don't see the business model working. My closest friend actually does automation software for large companies.

He does not use Claude or openai at all. He primarily uses gpt 120b on cerebras and glm-5.1 for heavy thinking work. And some other small models for various tasks. All open source.

And these systems are extremely useful for the businesses and are able to run fully automated pipelines that are very stable and fast.

We discuss this a lot, and we both think any business doing heavy agentic work on Claude and openai just aren't aware of exactly how good and cheap open source has gotten on the last year.

So... once the legacy businesses and developers catch up, won't Claude and openai be unable to recoup their costs?

peder 16 minutes ago||
> I don't see the business model working.

Same. It's a nightmare from a Porter's Five Forces perspective.

There will be a ton of businesses competing in this space, and there will be something of a moat due to how capital intensive the business can be, but there will still basically be infinite competitors.

Great for consumers.

mesmertech 1 hour ago|||
For coding you always want to go with the best model in the category, not something that would be the best model if we went 1 year back which GLM 5.1 is, and I'm saying that as a big fan of GLM cause I run a translation site where GLM is good enough for the price.

Most of the money right now is in coding. Openai and Anthropic just have to be 6 months ahead of SOTA open source models and they'll capture most of the enterprise and dev market

binary0010 54 minutes ago|||
Yes I'm an engineer (20 years most in games/graphics industry) and only use it for code. I've been using glm 5.1 this week a lot. I went in expecting another "decent" but not really "up to standard" open source model.

I highly doubt I'll ever use Claude again.

I think you are wrong about Claude being any significant level better

cassianoleal 28 minutes ago||
I've been mostly coding with GLM-5.1 as well and I agree with you. DeepSeek V4 Flash is another very good surprise. Incredibly cheap, fast and effective.
kgwgk 57 minutes ago||||
For coding like for everything else in life cost is a factor.
mesmertech 17 minutes ago||
Cost for the value delivered. Like if you offered the current SOTA open source models at $0.1/M, I still think I'd be using Opus or 5.5 at $30/M. Or say GPT 5 which was released Aug 25, I don't think I'd use it for coding for even $0.1. I'd def find other uses for it(translations, agentic workflows, prompt guards etc), but for coding I don't think I'd ever completely switch to a SOTA open model

Unless ofc there was an actual speed difference, only reason I'd be willing to go with a worse model couple of percent worse than current best model is if the speed was at least 5x higher. Looking forward to kimi k2.6 offered publicly by Cerebras

EGreg 59 minutes ago|||
Most work is not coding.

And also, people have it wrong… their models are not the main problem anymore. It’s the RAG

obsidianbases1 56 minutes ago||
Depending on RAG is a workflow problem, not an AI problem
smokel 53 minutes ago||
For coding assistance, I have tried OpenCode with several large open models through OpenRouter. All were fairly bad compared to Claude Opus. Could you provide some hints on how I should be holding these open models so that I might get more value out of them?

I agree with the common trope that open models lag behind by about a year, but something magical happened just around a year ago when the state of the art models became extremely useful. By this reasoning we're about to see open models perform well, but I'm afraid there is more to it than just waiting for another revolution around the sun.

Note, my application is coding assistance. Open models can be great for other purposes.

antman 51 minutes ago||
The costs are exorbitant and most software is not produced by companies with such a huge moat. Anthropic made a profit through their recent bait amd switch pricing. There is zero useful insights online to indicate whether this might die due to commoditisation with good enough open models or fail the race to get more people subsidising unsustainable growth with other people’s money. Who knows? In any case they dont seem to be able to drop usage costs so the business model seems based on wishes
j_w 2 minutes ago||
Continuing with your skepticism:

> Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff

> Enterprise customers are now paying API prices

How long before enterprise customers start to question the bill? Anthropic goes from not making money to doing pricing shakeup, and now they are making money and the biggest spenders are shocked at prices.

Seems like things are still very uncertain.

brokencode 45 minutes ago||
Usage costs will come down with better hardware. Hardware is improving rapidly each generation.
simonw 38 minutes ago||
That trend held true for the past three years, but it doesn't feel as safe to me now.

But memory costs are going way up. And both OpenAI and Anthropic bumped up the price of their frontier models in April.

StrauXX 32 minutes ago||
Algorithms are also improving. I believe it's very unlikely for these two improvements together to not result in one to two orders of magnitude cheaper cost per "intelligence". Of course, that might just make use cases that are too expensive today viable and thereby increase usage further.
sourcecodeplz 1 hour ago||
With deepseek and xiaomi mimo models slashing their prices 99%, I don't see a great future for openai / antrhopic with regards to their 1T valuations. Maybe 1T valuation will be the whole market, West + East.
skeledrew 43 minutes ago|
They'll still have their dedicated enterprise customers. I think the Chinese providers will pull more of the single users who're paying their own way, than those backed by company budget. And it's a pretty good split as the demand becomes better distributed, resulting in better service (I'll never forgot must how bad access to Claude became until they got access to Colossus) and less potential for lock-in (we really don't want there to be a duopoly, etc on good AI).
smokel 47 minutes ago|
Does this analysis factor in potential caching of tokens on the server side? It seems that if they organize things well (as a model provider), they can save quite a lot on that. Looking at my Cursor statistics makes it clear that the token calculations are not at all trivial.
simonw 42 minutes ago|
I believe the ccusage tool I used takes cached token pricing into account.
More comments...