Top
Best
New

Posted by ilreb 9 hours ago

AI's Affordability Crisis(blog.dshr.org)
203 points | 257 commentspage 2
jschveibinz 8 hours ago|
I don't have a crystal ball, but based on similar historical scenarios, I think that one or two of these companies will win--probably because of some unique application, delivery or trade secret that will drive 80% of their revenue.

Consider Google, Apple, Amazon, etc.

It's still early days...

CuriouslyC 8 hours ago||
The US govt is going to ban foreign models and foreign providers, and frontier labs are still cooked, because US companies will RLwash Chinese models to try and get in on the captive market. The frontier labs have already lost the war for coding, their next play is custom models for specific domains... Anthropic Galen for biomedical research, Anthropic Locke for legal analysis, etc, and you won't see _ANY_ intermediate work on the model, you will put in query, maybe get some questions fired back during work, and get a "final report."

Eventually the frontier labs will try to cut out the middle man once these models prove themselves and start doing partnerships with big firms in the domains, so they can take a % of the profits in perpetuity rather than just taking a one time payment. For example, after Anthropic Galen, they'll do a partnership with Pfizer to generate Ozempic-Superjacked and take 20% royalties on global sales.

hackingonempty 8 hours ago|||
> The US govt is going to ban foreign models

The people have a right to make and use whatever models they want, protected by the constitution. At a minimum, the models are described in research papers that are unquestionably protected speech. Skilled devs turn those into programs, also protected speech.

BigTTYGothGF 6 hours ago|||
> protected by the constitution

I don't see how.

8n4vidtmkvmk 7 hours ago||||
How could Trump ban tiktok then? And Fable for that matter.

Maybe you're somehow legally allowed to distribute and download the weights, but most of us can't run GLM 5.2 at home.

wqaatwt 1 hour ago|||
They “banned” the company, not the product. US government could ban US citizens from buying tokens from certain Chinese provider but there is no precedent for banning the usage of specific software if you run it yourself.
hdgvhicv 59 minutes ago||
Decss
verdverm 7 hours ago|||
You won't need a frontier size model for most tasks before long. Qwen 3.6 (small) punches way above its weight. I run it at home @8bit on an OEM Spark
kappar 6 hours ago|||
Second this, I am also running qwen 3.6 35b Q8 on a 5090 liquid getting around 250 tokens / second and it is plenty capable. I actually haven't even looked at models recently because I am happy with what I have.

And.. now I feel the need to look again. Darn, there goes my afternoon

dualvariable 7 hours ago|||
And corporations could run DeepSeek models on cloud hardware.
verdverm 6 hours ago||
You can run most open models on cloud hardware. Google Cloud gives you a click to deploy, but then you have saturation / ROI considerations, versus Google serving them up multi-tenant, per-token.
dualvariable 5 hours ago||
For ROI though you can run 24/7 agentic-style workloads, constantly churning through all your source code looking for security bugs (or whatever) and you DONT pay per-token costs.

A DeepSeek instance running 24/7 in a cloud provider will beat doing that with Claude which could bankrupt you with 100x more costs, even though it might find more.

And DeepSeek may find enough to keep your engineering team saturated and busy fixing things.

verdverm 2 hours ago||
This process works better with multiple models and not simply slinging Ai at it 24/7. It's not an ROI just because you keep the GPU busy. The signal to noise ratio is not there yet
skywhopper 7 hours ago||||
The US government isn’t supposed to be allowed to constrain speech, but they do have the power to constrain commerce, and they can ban the sale of AI services and AI-capable hardware if they choose.
andrekandre 6 hours ago||

  > but they do have the power to constrain commerce
its an interesting idea; i'd like to see someone claim buying/selling as a form of speech...
Supermancho 2 hours ago||
Citizens United got pretty close in the Supreme Court's 5–4 decision on January 21, 2010, ruling that corporations and unions cannot be prohibited from making independent political expenditures, citing First Amendment free speech protections.
andrekandre 22 minutes ago||
yea, that was the hint in my comment... just waiting for the other shoe to drop so to speak
giantrobot 8 hours ago||||
The current administration has repeatedly demonstrated they do not feel constrained by laws or the Constitution.
tmpz22 8 hours ago|||
Yes our passionate defense of Academia will surely survive Techno Oligarchs desire for a 20th vacation home
woeirua 6 hours ago||||
>The frontier labs have already lost the war for coding

This is a delusional take. Sorry, but anyone claiming this hasn't used Fable and compared it to the current best open source models. I see a lot of hype posting about GLM5.2. I see absolutely ZERO people using it in production compared to GPT 5.5 or Opus 4.8.

CuriouslyC 4 hours ago||
Coding agents are edging into diminishing returns for common tasks. The whole Opus 4.5+ arc shows this for a large swathe of people. Chinese Fable is likely <=6 months away. US Frontier labs are structurally disadvantaged in the long term so advantage is only going to skew China.
Analemma_ 8 hours ago|||
> The frontier labs have already lost the war for coding

You are way too deep in the HN bubble.

CuriouslyC 8 hours ago||
I'm looking at how market/human forces are going to make the game play out when extended to its logical conclusion, not the score on the scoreboard RIGHT now.
com2kid 8 hours ago|||
So long as Chinese labs keep writing white papers, trade secrets aren't going to win the day.

Having growth up in the 90s, it is weird seeing companies share their technology secrets publicly.

dofm 8 hours ago||
Wandering around pretending to be researchers who are only just figuring out how to make money is, for the short term, an incredibly good way to attract a load of naïve money; not all sharks are smart.

And it does, nowadays, give you a bit of a veneer of mere curiosity when you're being accused of massive theft.

sowbug 7 hours ago|||
We're seeing the first 20 years of the dot-com cycle, but compressed into two years, and trying hard not to fall into the tar pit of ad-supported services.
dualvariable 7 hours ago|||
I'd guess Anthropic will probably win, and LLMs will probably still be with us and be much better in 10 years time.

But next year we could be in the middle of a massive $600B/yr capital-spending bubble deflating hard with unemployment accelerating towards 10% (or higher).

The internet never failed, but the telcom/dotcom collapse still happened in 2001.

kajman 6 hours ago||
I'm sure investors thought one or two of the ISPs laying all that fiber would be collecting fat rents on them until the sun burnt out. I'm glad they got so much in the ground before there was a reckoning. I hope this industry ships more very expensive models, ASAP.
a34729t 7 hours ago||
Deepseek is 90% cheaper, and nearly as good for coding tasks as claude/codex, and as good given the right plan.

The only moat OpenAI and Anthropic have is regulation. If the Chinese really eant to hammer us, they could realse the full training data and pipeline.

thewebguyd 6 hours ago|
Even without doing that the Chinese are already going to impact our labs presence everywhere else in the world. With Fable getting pulled, any model coming out of the US is now unreliable and untrusted. No one in any other country would in their right mind choose OpenAI or Anthropic for anything.

The big push for regulation and export controls is only going to ensure OpenAI & Anthropic are more like the automakers. Only in business because of protectionism, left to screw over US consumers meanwhile the rest of the world gets to enjoy cheap EVs

a34729t 6 hours ago||
I have to push back on this: China's cheap EVs and power prices are due to industrial policy on an epic scale which goes directly against the whole free market thing. I personally think industrial policy is a good thing, but you cannot have it both ways and not expect workers to get unhappy and vote against your interests when they have no more jobs.
xboxnolifes 1 hour ago|||
Even within the definition of free market there exists degrees of freedom. On one side, that id expect most Americans to be familiar with, is laissez faire capitalism. Im not sure how far it goes into the other direction before it stops being considered a free market, but if the existence of government incentives on the market stops it from being a free market, the US is also not a free market.

That is to say, I believe free markets can exist along side government policy.

thewebguyd 5 hours ago|||
True, and its what they are going to do with LLMs as well. We know their playbook by now as they've repeated it over and over again across different industries.

But we can still protect domestic workers without screwing over consumers. Pure protectionism doesn't work, it'll only set us back and keep us behind. Just slapping on 100% tariffs or a complete import ban just lets domestic companies get lazy. The protectionism needs an expiry date so they can't hide behind it forever. We could also work to move supply chains out of adversarial nations and into friendly ones, but you know...that requires us to continue to have friends and allies.

A fully free market has been an illusion in the US for a very long time. We'd do well to do some of our own state-industrial planning.

recursivedoubts 7 hours ago||
Once locals get to Opus levels I think it we may see a phase change because that + a reasonably competent programmer is going to be a very powerful combination for most practical programming problems.

Frontier models may eventually achieve super-intelligence (no opinion beyond mild skepticism) but super-intelligence isn't necessary for most practical day-to-day programming. The problems, as always, become communication, understanding what users really need, etc. that is, softer skills.

wqaatwt 1 hour ago||
> Once locals get to Opus levels

I find it hard to imagine it would ever be cost efficient vs hosted/cloud i.e. you should always be able to run faster and/or better models remotely at a comparable price since its just way more efficient due to batching

cheonic52749 7 hours ago||
> Frontier models may eventually achieve super-intelligence but super-intelligence isn't necessary for most practical day-to-day programming

I think you forgot what super-intelligence means…

recursivedoubts 7 hours ago||
tbh, not sure i ever understood it
sdenton4 4 hours ago|||
In discussions of super intelligence and ai takeoff and such, I find it helpful to ask why the smartest humans usually aren't heads of state...
LoganDark 7 hours ago|||
A superintelligence is one that exceeds human intelligence in all areas. Which roughly translates to learning, adapting, and performing more quickly and efficiently than even the best humans. This is closely related to "the singularity", which is when technological growth becomes uncontrollable by humanity.
bryanlarsen 7 hours ago||
That sounds like a definition designed for goal shifting. This AI is better than human at 99.9% of things, but humans are better at pooping. Therefore we don't yet have a superintelligence.
wqaatwt 1 hour ago|||
A basic calculator is 100% better than humans at all the things its designed to do.
LoganDark 7 hours ago|||
If that AI were given an identical human body (and interface to that body) to someone who had not yet learned how to do that, and it outperformed them in figuring it out, then that would settle it.

Otherwise I don't see the comparison.

If I'm intelligent enough to use a tool, but I don't have the tool, that doesn't mean anyone who does have the tool is automatically more intelligent than me.

Likewise, comparing my performance without the tool against someone's performance with the tool wouldn't be benchmarking their performance, only benchmarking them with the tool's performance. The fairer comparison would be against me also with the tool.

titzer 7 hours ago||
The coming AI enshittification is going to be epic. For those of us who have been on the web for more than five minutes, we can see this a mile away.

If you think search ads are annoying, pre-roll YouTube ads are annoying, streaming ads are annoying, or basically ads-on-any-screen-anywhere-at-any-time are annoying, just wait until every stupid thing is powered by AI and is subtly trying to manipulate you to buy/watch/believe some crap all the time.

VorpalWay 48 minutes ago||
I'm not sure how that would legally work in EU: several countries have strict rules about ads having to be clearly distinguished from non-ads material. I know that UK has pretty strict rules about product placement in broadcasts too for example.

Yes they can do ads, but if they try to be subtle they will likely (eventually) be hit with fines.

Though, do the current rules apply to AI? Likely unclear. But if this becomes a problem I would expect new consumer protection regulation to be introduced aimed at this specific issue.

asdff 3 hours ago|||
It is already ruining forums including this one. People posting AI slop articles. Probably a good deal of commenters using AI slop to write their comments. Can't run away from it. Eventually you will have to step away entirely because you will just be interfacing mainly with AI content. I'm about fed up with forums at this point and I doubt I will be around them in the coming months at this rate of ever increasing slop peddling. I suspect more and more human contributors will feel the same and start slowly walking away, snowballing the slopification effect.

There is going to be a point soon where HN is just ai models posting ai articles to be filled with ai comments and for what reason exactly? I guess to try and train new ai slop company products into the datasets of various ai models to capture the budget spend of some ai middle manager model.

mannanj 40 minutes ago||
Wasn't this the end goal in the a totalitarian state in the novels by Orwell and sort of described in the movie Ready Player One?

The gist of it as I understand it is in a society where things are fake and incredibly extractive (where a select few, bourgeoisie or rich prioritize their interests over others like we see accelerating today) they limit the forums available for people to question them and peddle their interests on the select few that remain. If you sufficiently isolate the people, it's hard for them to tell whats real and eventually they come to accept the fake narratives as truth.

In an odd way, this sort of fits in with the theories about winners writing history, and all those weird, sort of conspiracy-laden accounts of human history having these odd unexplainable gaps or stories around it. I don't know about you, I think we are simply seeing those forces of the past working at preserving their interests and using the latest technology to do it.

bryanlarsen 7 hours ago||
Jeopardizing a $200/month subscription in return for $1/month in ad revenue seems insane. Using ads on a $20/month subscription to entice you into a $200/month one, OTOH...
titzer 7 hours ago||
They will almost certainly get more than $1/month in ad revenue for someone interacting with the AI for hours a day.
wqaatwt 7 hours ago||
This article seems to be struggling with telling apart the difference between R&D and operating expenses? The fact that AI companies are extremely unprofitable doesn’t mean they are subsidizing token costs, they still can have very decent gross margins on them
Gigachad 1 hour ago|
The accounting is all out of whack with these companies. If a GPU had a useful lifespan of around 3 years than it’s more a constant expense than R&D. They want you to believe they can cut the R&D at any time and be profitable but it’s likely they would completely collapse without that spend.
zhivota 13 minutes ago||
I don't see why 3 years is the right number there. I'm using a 10 year old model GPU to generate tokens locally, and given the bottlenecks, a commercial model focused on RAM and transfer speeds should have a longer depreciation curve than 3 years.
yalogin 7 hours ago||
The issue is the cost is not going to be a hindrance for companies that have gone all in on the AI development. They may still find it cheaper than hiring engineers and if needed they will layoff a few more.

The companies that did not yet jump on this bandwagon and are still evaluating will have a decision to make.

No matter what the AI companies are going to change their pricing strategy and it’s going to become a lot lot more expensive to use. I am just hoping the price stays like this until I am done with my big chunk of work

travisb 7 hours ago||
I think a lot of the cost comparisons to employees are off by a factor of 2 or more. AI is the ultimate contractor. Available instantly. Doesn't charge during idle periods. Pre-vetted and pre-trained. No contract negotiations or complex accounting.

That is worth a small multiple of the fully-loaded employee cost. So AI might be easily worth more than $200 per human-equivalent hour. With high utilization, that might be $8000-10000 a month.

With that kind of spend, AI provider financials looks less frightening.

lbreakjai 23 minutes ago||
AI is more like a union that controls the entire labour pool. I'm one out of a few million developers. I've got very limited bargaining power. I can't withdraw my labour because I need it to make rent and buy food. My cost has a very predictable ceiling.

On the other hand, there's two AI labs, that could afford to eat your profit, because what are you gonna do? They're your entire labour force.

ilovecake1984 1 hour ago||
Agree. But if that’s the thinking then you need to compare vs off shore contract rates, not on shore contract rates.
jdw64 7 hours ago||
I can't go back to a life without AI, and I don't want to. But if AI were billed by token instead of subscription, my monthly cost would probably be ten times what it is now. I could switch to a Chinese model, but I'm not sure how things will look by then.

What makes AI so convenient is how good it is at doing red-team code reviews on my work. I used to need all this unnecessary communication just to get a review, but now I only have to reach out to the people I actually want to talk to.

KolibriFly 5 hours ago|
I feel like the author is jumping way too fast from "OpenAI is losing money" to "the whole AI economy is broken." A company being in the red during aggressive scaling doesn't automatically mean the unit economics don't work.
More comments...