Top
Best
New

Posted by ripe 9/1/2025

Amazon has mostly sat out the AI talent war(www.businessinsider.com)
https://archive.ph/ed8WJ
363 points | 660 comments
Traster 9/2/2025|
Zuckerberg rushing into every new fad with billions of dollars has somehow tricked people into thinking that's what big tech is about and all of them should be shovelling money into this.

But actually every other company has been much more strategic, Microsoft is bullish because they partnered up with OpenAI and it pumps their share price to be bullish, Google is the natural home of a lot of this research.

But actually, Amazon, Apple etc aren't natural homes for this, they don't need to burn money to chase it.

So there we have it, the companies that have a good strategy for this are investing heavily, the others will pick up merges and key technological partners as the market matures, and presumably Zuck will go off and burn $XB on the next fad once AI has cooled down.

JCM9 9/2/2025||
I generally agree with you, although Amazon is really paranoid about being behind here.

On the last earnings call the CEO gave a long rambling defensive response to an analyst question on why they’re behind. Reports from the inside also say that leaders are in full blown panic mode, pressing teams to come up with AI offerings even though Amazon really doesn’t have any recognized AI leaders in leadership roles and the best talent in tech is increasingly leaving or steering clear of Amazon.

I agree they should just focus on what they’re good at, which is logistics and fundamental “boring” compute infrastructure things. However leadership there though is just all over the map trying to convince folks their not behind vs just focusing on strengths.

ericmcer 9/2/2025|||
Doesn't Amazon have a huge lead just because of AWS? Every other player is scrambling for hardware/electricity while Amazon has been building out data centers for the last 20 years.
dragonwriter 9/2/2025|||
> Doesn't Amazon have a huge lead just because of AWS?

They have huge exposure because of AWS; if the way people use computing shifts, and AWS isn't well-configured for AI workloads, then AWS has a lot to lose.

> Every other player is scrambling for hardware/electricity while Amazon has been building out data centers for the last 20 years.

Microsoft and Google have also been building out data centers for quite a while, but also haven't sat out the AI talent wars the way Amazon has.

deanCommie 9/2/2025||
> AWS isn't well-configured for AI workloads

What does that mean? Not enough GPUs?

JCM9 9/2/2025||
A few things, including:

1. Price-performance has struggled to stay competitive. There’s some supply-demand forces at play, but the top companies consistently seem to strike better deals elsewhere.

2. The way AWS is architected, especially on networking, isn’t ideal for AI. They’ve dug their heels on in their own networking protocols despite struggling to compete on performance. I personally know of several workloads that left AWS because they couldn’t compete on networking performance.

3. Struggling on the managed side. On paper a service like Bedrock should be great but in practice it’s been a hot mess. I’d love to use Anthropic via Bedrock, but it’s just much more reliable when going direct. AWS has never been great at these sort of managed services at scale and they’re again struggling here.

JCM9 9/2/2025||||
In theory they should, but it’s increasingly looking like they’re struggling to attract/retain the right talent to take advantage of that position. On paper they should be wiping the floor with others in this space. In practice they’re getting their *ss kicked and in a panic on what to do.
dataking 9/2/2025|||
My understanding is that they fell behind on offering the latest gen Nvidia hardware (Blackwell/Blackwell Ultra) due to their focus on internally developed ASICs (Trainium/Inferentia gen 2).
butlike 9/2/2025|||
Which bar raiser is going to raise the bar first??
alexc05 9/2/2025|||
I'd argue that Meta's income derives in no small part from their best in class ad targeting.

Being on the forefront of

(1) creating a personalized, per user data profile for ad-targeting is very much their core business. An LLM can do a very good job of synthesizing all the data they have on someone to try predicting things people will be interested in.

(2) by offering a free "ask me anything" service from meta.ai which is tied directly to their real-world human user account. They gather an even more robust user profile.

This isn't in-my-opinion simply throwing billions at a problem willy nilly. Figuring out how to apply this to their vast reams of existing customer data economically is going to directly impact their bottom line.

WtfRuSerious 9/2/2025|||
5 minutes on facebook being force-fed mesopotamian alien conspiracies is all you'll need to experience to fully understand just how BADLY they need some kind of intelligence for their content/advertising targeting, artificial or not...
graemep 9/2/2025||||
Obviously one is a very bad sample, but why are the ads I see on FB so badly targetted?
Scaevolus 9/2/2025|||
You probably don't spend enough time on their sites to have a good ad targeting model of you developed. The closer you are to normal users, with hundreds of hours of usage and many ad clicks, the more accurate the ads will be for you.
butlike 9/2/2025||
You mean the closer I am to the top of the bell curve, the more your ads "shooting from the hip" will land? Who would've thunk it?!
agent327 9/2/2025||||
Did you block their tracking across the whole damn internet, by any chance?
potro 9/2/2025||||
Same terrible experience for me while I was on FB. I was spending a lot of time there and I do shop a lot online. They couldn’t come with relevant ad targeting for me. For my wife they started to show relevant ads AFTER she went to settings and manually selected areas she is interested in. This is not an advanced technology everyone claim FB has.
sharadov 9/2/2025|||
Instagram has killer ad targeting; no wonder all these direct-to-consumer brands flock there. FB not so much I agree.
dylan604 9/2/2025||||
>An LLM can do a very good job of synthesizing all the data they have on someone to try predicting things people will be interested in.

Is synthesizing the right word here?

veidr 9/2/2025||
I think is absolutely is, LOL. Though a "very good job of synthesizing" might not actually good for much...
idopmstuff 9/2/2025|||
People look at all the chaos in their AI lab but ignore the fact that they yet again beat on earnings substantially and directly cited better ad targeting as the reason for that. Building an LLM is nice for them, but applying AI to their core business is what really matters financially, and that seems like it's going just fine.
HarHarVeryFunny 9/2/2025|||
The largest LLMs are mostly going to be running in the cloud, so the general purpose cloud providers (Amazon, Microsoft, Google) are presumably going to be in the business of serving models, but that doesn't necessarily mean they need to build the models themselves.

LLMs look to be shaping up as an interchangeable commodity as training datasets, at least for general purpose use, converge to the limits of the available data, so access to customers seems just as important, if not more, than the models themselves. It seems it just takes money to build a SOTA LLM, but the cloud providers have more of a moat, so customer access is perhaps the harder part.

Amazon do of course have a close relationship with Anthropic both for training and serving models, which seems like a natural fit given the whole picture of who's in bed with who, especially as Anthropic and Amazon are both focused on business customers.

GloriousMEEPT 9/2/2025||
Microsoft is building it's own in-house LLM's based on OpenAI's IP. Google builds it's own models.
HarHarVeryFunny 9/2/2025||
Sure, but you can also sell something without having built it yourself, just as Microsoft Copilot supports OpenAI and Anthropic models.

It doesn't have to be either/or of course - a cloud provider may well support a range of models, some developed in house and some not.

Vertical integration - a cloud provider building everything they sell - isn't necessarily the most logical business model. Sometimes it makes more sense to buy from a supplier, giving up a bit of margin, than build yourself.

GloriousMEEPT 9/2/2025||
I'm just an observer. Microsoft has invested billions in OpenAI and can access their IP as a result. It might even be possible MS hopes that OpenAI fails and doesn't allow them to restructure to continue to acquire outside funding. You can go directly to the announcement of their in-house model offerings and they are clearly using this as a recruiting tool for talent. Whether it makes sense for the cloud providers to build their own models is not for me to say, but they may not have a choice given how quickly OpenAI/Anthropic are burning cash. If those two fail then they're essentially ceding the market to Google.
jayd16 9/2/2025|||
I think this analysis is a bit shallow with regard to Metas product portfolio and how AI fits in.

Much more than the others, metter runs a content business. Gen AI aides in content generation so it behooves them to research it. Even before the current explosion of chatbots, meta was putting this stuff into their VR framework. It's used for their headset tracking and speech to text is helpful for controlling a headset without a physical keyboard.

You're making it sound like they'll follow anything that walks by but I do think it's more strategic than that.

gus_massa 9/2/2025|||
Zuckerberg bought Whatsapp and Instagram. For normal people, those replaced 90% of the internet here in Argentina

(The other 10% is mostly Google Maps and MercadoLibre.)

danieldk 9/2/2025||
But that didn't require deep insight. Both were already really popular and clearly a threat to Facebook. WhatsApp was huge in Europe before they bought (possibly other places as well).

Buying competition is par for the course for near-monopolies in their niches. As long as the scale differences in value are still very large, you can avoid competition relatively cheaply, while the acquired still walk away with a lot of money.

YetAnotherNick 9/2/2025|||
Why does investing in AI require deep insight? ChatGPT is already huge, significantly bigger than Whatsapp when the deal was done. And while OpenAI is not for sale, he figured that their employees are. Also not to mention, investors are very positive for AI.
PhunkyPhil 9/2/2025||
So far there hasn't been a transformative use case for LLMs besides the straightforward chat interface (Or some adjacent derivative). Cursor and IDE extensions are nice, but not something that generates billions in revenue.

This means there's two avenues:

1. Get a team of researchers to improve the quality of the models themselves to provide a _better_ chat interface

2. Get a lot of engineers to work LLMs into a useful product besides a chat interface.

I don't think that either of these options are going to pan out. For (1), the consumer market has been saturated. Laymen are already impressed enough by inference quality, there's little ground to be gained here besides a super AGI terminator Jarvis.

I think there's something to be had with agentic interfaces now and in the future, but they would need to have the same punching power to the public that GPT3 did when it came out to justify the billions in expenditure, which I don't think it will.

I think these companies might be able to break even if they can automate enough jobs, but... I'm not so sure.

YetAnotherNick 9/2/2025|||
Whatsapp had $10M revenue when it was acquired[1]. Lots of so called "chatgpt wrappers" has more revenue than that. While in hindsight Whatsapp acquisition at $19B seems no brainer, no concrete metric pointed to that compared to him investing $19B in AI now.

[1]: https://www.sec.gov/Archives/edgar/data/1326801/000132680114...

utyop22 9/3/2025||
Dude Zuckerberg bought whatsapp because FB Messenger was losing market share... nothing to do with Whatsapps revenue! Rather Zuckerbergs fear of FB products being displaced.
bonsai_bar 9/2/2025|||
> Cursor and IDE extensions are nice, but not something that generates billions in revenue.

I mean Cursor is already at $500 million ARR...

PhunkyPhil 9/2/2025||
How many software engineers are there in the world? How many are going to stop using it when model providers start increasing token cost on their APIs?

I could see the increased productivity of using Cursor indirectly generating a lot more value per engineer, but... I wouldn't put my money on it being worth it overall, and neither should investors chasing the Nvidia returns bag.

therealdrag0 9/2/2025|||
Pretty sure everyone was balking at the purchase prices at the time
utyop22 9/3/2025||
In the UK it was an obvious purchase - whatsapp was the main mode of communicaton on a phone. Nobody used Messenger for instance.
h1fra 9/2/2025|||
Amazon strategy is to invest in the infrastructure, money is where the machines live. I think they just realized none of those companies have a moat, so why would they? But all of them will buy compute
JCM9 9/2/2025|||
Except they’re struggling here. The performance of their offerings is consistently behind competitors, particularly given their ongoing networking challenges, and they’re consistently undercut on pricing.

For Amazon “renting servers” at very high margin is their cash cow. For many competitors it’s more of a side business or something they’re willing to just take far lower margin on. Amazon needs to keep the markup high. Take away the AWS cash stream and the whole of Amazon’s financials start to look ugly. That’s likely driving the current panic with its leadership.

Culturally Amazon does really well when it’s an early mover leader in a space. It really struggles, and its leadership can’t navigate, when it’s behind in a sector as is playing out here.

adventured 9/2/2025||
Under what scenario does Amazon lose the beast that is its high margin cloud service renting? It appears to be under approximately zero threat.

Companies are not going to stop needing databases and the 307 other things AWS provides, no matter how good LLMs get.

Cheaper competitors have been trying to undercut AWS since the early days of its public availability, it has not worked to stop them at all. It's their very comprehensive offering, proven track record and the momentum that has shielded AWS and will continue to indefinitely.

geodel 9/2/2025|||
AWS is losing marketshare to Azure and GCP. This is big deal, it was unexpected after years of Google/Microsoft trying and failing.

Further AWS is losing share at a time when GCP and Azure are becoming profitable businesses, so no longer losing money to gain market share.

JCM9 9/2/2025||||
It’s already playing out. Just look at recent results. While once light years ahead competitors are now closing ranks and margins are under pressure. AWS clearly isn’t going away, but on the current trajectory its future as the leading cloud is very much not a certainty.
tguedes 9/2/2025||||
Because if LLM inference is going to be a bigger priority for the majority of companies, they're going to go where they can get the best performance to cost ratio. AWS is falling behind on this. So companies (especially new ones) are going to start using GCP or Azure, and if they're already there for their LLM workloads, why not run the rest of the infrastructure there?

It's similar to how AWS became the de-facto cloud provider for newer companies. They struggled to convince existing Microsoft shops to migrate to AWS, instead most of the companies just migrated to Azure. If LLMs/AI become a major factor in new companies deciding which will be their default cloud provider, they're going to pick GCP or Azure.

breppp 9/2/2025|||
Except for spending cloud budgets on LLMs elsewhere like other mentioned, LLM coding will make it easier to convert codebases from being AWS dependent, easing their lock-in
zaphirplane 9/2/2025||||
I would be surprised if a cloud market leader thinks winning on commodity vm rental is a strategy
mhb 9/2/2025|||
And electricity.
giancarlostoro 9/2/2025|||
Microsoft has the pleasure of letting you pay for your own hosted GPT models, Mixtral, etc

Microsoft's in a sweet spot. Apple's another interesting one, you can run local LLM models on your Mac really nicely. Are they going to outcompete an Nvidia GPU? Maybe not yet, but they're fast enough as-is.

malfist 9/2/2025|||
> But actually, Amazon, Apple etc aren't natural homes for this, they don't need to burn money to chase it.

Amazon is the biggest investor of AI of any company. They've already spent over $100b YTD on capex for AI infrastructure.

ZiiS 9/2/2025|||
This is "shovels" they rent out; very different then research investment.
dylan604 9/2/2025|||
To do what for that money? Write summaries of product reviews? If they wanted to do something useful, they'd use the LLM to figure out which reviews are for a different product than what is currently being displayed.
veidr 9/2/2025||
"useful" means different things
kcplate 9/2/2025|||
> But actually, Amazon, Apple etc aren't natural homes for this, they don't need to burn money to chase it.

I really liked the concept of Apple Intelligence with everything happening all on device, both process and data with minimal reliance off device to deliver the intelligence. It’s been disappointing that it hasn’t come to fruition yet. I am still hopeful the vapor materializes soon. Personally I wouldn’t mind seeing them burning a bit more to make it happen.

mleo 9/2/2025||
It will likely occur, just maybe not this year or next. If we look over the last eighty years of computing, the trend has been smaller and more powerful computers. No reason to think this won’t occur with running inference on larger models.
burnte 9/2/2025|||
Exactly. Being a tech company doesn't mean you need to do everything any more than just because you're a family doctor you also should do trauma surgery, dentistry, and botox injections. Pick a lane, be an expert in it.
buyucu 9/2/2025|||
Except that Amazon's AWS business is severely threatened by the rise of alternative cloud providers who offer much more AI-friendly environments. It's not an existential topic yet, but could easily turn into one.
chaosbolt 9/2/2025|||
Zuckerberg knows what he's doing, his net worth is >250 billion dollars now.

Go all in the new fad, investors pile up on your stock, dump, repeat...

aleph_minus_one 9/2/2025||
> Zuckerberg knows what he's doing, his net worth is >250 billion dollars now.

Does he have this net worth because what he is doing or despite what he is doing? :-)

Correlation does not imply causation. Attribution is hard.

throwawayq3423 9/2/2025|||
By that logic, a social media company shouldn't rush into it either, but they did anyway.
DonsDiscountGas 9/2/2025|||
Zuckerbergs AI "strategy" seems to be to make it easy for people to generate AI slop and share it on FB thus keeping them active on the platform. Or to give people AI "friends" to interact with on FB, thus keeping them on the platform and looking at ads. It's horrifying but it does make business sense (IMHO) at least at first glance.
idiomat9000 9/2/2025|||
They have out sensors though for any AGI, because AGI could subvert buisness fields and expertise moats. Thats what most AI teams are- vanity projects and a few experts calming the higher ups every now and then with a "its still just autocompletion on steroids, it can not yet do work and research alone."
ath3nd 9/2/2025|||
> Zuckerberg rushing into every new fad with billions of dollars has somehow tricked people into thinking that's what big tech is about and all of them should be shovelling money into this.

Zuckerberg failed every single fad he tried.

He's becoming more irrelevant every year and only the company's spoils from the past (earned not less by enabling, for example, a genocide to be committed in Myanmar https://www.pbs.org/newshour/world/amnesty-report-finds-face...) help carry them through to the series of disastrous idiotic decision Zuck is inflicting on them.

- VR with Oculus. It never caught on, for most people who own one, it's just gathering dust.

- Metaverse. They actually spend billions on that? https://www.youtube.com/watch?v=SAL2JZxpoGY

- LLAMA is absolute trash, a dumpster fire in the world of LLMs

Zuck is now trying to jump again on the LLM bandwagon and he's trying to...buy his way in with ridiculous pay packages: https://www.nytimes.com/2025/07/31/technology/ai-researchers.... Why is he so wrong to do that, you might ask?

He is doing it at the worst possible moment: LLMs are stagnating and even far better players than Meta like Anthropic and OpenAI can't produce anything worth writing about.

ChatGPT5 was a flop, Anthropic are struggling financially and are lowering token limits and preparing users for cranking up prices, going 180 on their promises not to use chat data for training, and Zuck, in his infinite wisdom, decides to hire top AI talent for premium price at a rapidly cooling market? You can't make up stuff like that.

It would appear that apart from being an ass kisser to Trump, Zuck shares another thing with the orange man-child running the US: a total inability to make good, or even sane deals. Fingers crossed that Meta goes bankrupt just like Trump's 6 banrkruptcies and then Zuck can focus on his MMA career.

code_for_monkey 9/2/2025|||
I've been taking heat for years for making fun of the metaverse. I had hopeful digital landlords explain to me that theyll be charging rent in there! Who looked at that project and thought it was worth anything?
williamdclt 9/2/2025|||
> I've been taking heat for years for making fun of the metaverse

I don't know in what circles you're hanging out, I don't know a single person who believed in the metaverse

ath3nd 9/2/2025||
> I don't know in what circles you're hanging out, I don't know a single person who believed in the metaverse

Oh please, the world was full of hype journalists wanting to sound like they get it and they are in it, whatever next trash Facebook throws their way.

The same way folks nowadays pretend like the LLMs are the next coming of Jesus, it's the same hype as the scrum crowd, the same as crypto, nfts, web3. Always ass kissers who cant think for themselves and have to jump on some bandwagon to feign competence.

Look at what the idiots at Forbes wrote: https://www.forbes.com/councils/forbestechcouncil/2023/02/27...

They are still very influential, despite having shit takes loke that.

Accenture still think the Meta is groundbreaking: https://www.accenture.com/us-en/insights/metaverse

What a bunch of losers!

71% of executives seemed to be very excited about it: https://www.weforum.org/stories/2022/04/metaverse-will-be-go...

Executives (like Zuck) are famous for being rather stupid so if they are claiming something, you bet its not gonna happen.

Apparently, "The metaverse is slowly becoming the new generation’s digital engagement platform, but it’s making changes across enterprises, too."

https://www.softserveinc.com/en-us/blog/the-promise-of-the-m...

mring33621 9/2/2025|||
i don't care about virtual real estate, but VR mini golf sure is fun!
HDThoreaun 9/2/2025|||
meta made $62 billion dollars last year. Mark burns all this money because his one and only priority is making sure his company doesnt become an also ran. The money means nothing to him
ath3nd 9/3/2025||
Yet his company and him are becoming rapidly irrelevant.
physhster 9/2/2025|||
So does Pichai... Every time there is something new, he forces Google to pivot, upending everything without much to show for it.
rorads 9/2/2025|||
Google basically invented modern AI (the 'T' in ChatGPT stands for Transformer), then took a very broad view of how to apply broadly neural AI - AlphaGo, AlphaGenome being the kind of non-LLM stuff they've done).

A better way to look at it is that the absolute number 1 priority for google since they first created a money spiggot throguh monetising high-intent search and got the monopoly on it (outside of Amazon) has been to hold on to that. Even YT (the second biggest search engine on the internet other than google itself) is high intent search leading to advertising sales conversion.

So yes, google has adopted and killed lots of products, but for its big bets (web 2.0 / android / chrome) it's basically done everything it can to ensure it keeps it's insanely high revenue and margin search business going.

What it has to show for it is basically being the only company to have transitioned as dominent across technological eras (desktop -> web2.0 -> mobile -> (maybe llm).

As good as OpenAI is as a standalone, and as good as Claude / Claude Code is for developers, google has over 70% mobile market share with android, nearly 70% browser market share with chrome - this is a huge moat when it comes to integration.

You can also be very bullish about other possible trends. For AI - they are the only big provider which has a persistent hold on user data for training. Yes, OpenAI and Grok have a lot of their own data, but google has ALL gmail, high intent search queries, youtube videos and captions, etc.

And for AR/VR, android is a massive sleeping giant - no one will want to move wholesale into a Meta OS experience, and Apple are increasingly looking like they'll need to rely on google for high performance AI stuff.

All of this protects google's search business a lot.

Don't get me wrong, on the small stuff google is happy to let their people use 10% time to come up with a cool app which they'll kill after a couple of years, but for their big bets, every single time they've gone after something they have a lot to show for it where it counts to them.

msabalau 9/2/2025||
Yeah, and Google has cared deeply about AI as a long term play since before they were public. And have been continuously invested there over the long haul.

The small stuff that they kill is just that--small stuff that was never important to them strategically.

I mean, sure, don't heavily invest (your attention, time, business focus, whatever) in something that is likely to be small to Google, unless you want to learn from their prototypes, while they do.

But to pretend that Google isn't capable of sustained intense strategic focus is to ignore what's clearly visible.

BoredPositron 9/2/2025|||
When did Google ever pivot?
chubot 9/2/2025|||
I haven't followed that closely, but Gemini seems like a pivot based on ChatGPT's market success

Google is leading in terms of fundamental technology, but not in terms of products

They had the LLambda chatbot before that, but I guess it was being de-emphasized, until ChatGPT came along

Social was a big pivot, though that wasn't really due to Pichai. That was while Larry Page was CEO and he argued for it hard. I can't say anyone could have known beforehand, but in retrospect, Google+ was poorly conceived and executed

---

I also believe the Nth Google chat app was based on WhatsApp success, but I can't remember the name now

Google Compute Engine was also following AWS success, after initially developling Google App Engine

itsoktocry 9/2/2025|||
>I haven't followed that closely, but Gemini seems like a pivot based on ChatGPT's market success

"AI" in it's current form is already a massive threat to Google's main business (I personally use Google only a fraction of what I used to), so this pivot is justified.

sabas123 9/2/2025|||
Is it really such a pivot when they invested a lot in AI already?

They bought DeepMind in 2014 and always showed of a ton of AI research.

swiftcoder 9/2/2025||||
https://killedbygoogle.com
devin 9/2/2025||
None of these are pivots. The core business has always been the core business.
swiftcoder 9/2/2025||
If you are defining "pivot" as "abandon all other lines of business", then no, none of the BigTechs have ever pivoted.

By more reasonable standards of "pivot", the big investment into Google Plus/Wave in the social media era seems to qualify. As does the billions spent building out Stadia's cloud gaming. Not to mention the billions invested in their abandoned VR efforts, and the ongoing investment into XR...

msabalau 9/2/2025|||
G+ was a significant effort that was abandoned.

I'd personally define that as Google hedging their bet's and being prepared in case they needed to truly pivot, and then giving up when it became clear that they wouldn't need to. Sort of like "Apple Intelligence" but committing to the bit, and actually building something that was novel, and useful to some people, who were disappointed when it went away.

Stadia was always clearly unimportant to Google, and I say that as a Stadia owner (who got to play some games, and then got refunds.) As was well reported at the time, closing it was immaterial to their financials. Just because spending hundreds of millions of dollars or even a few billion dollars is significant to you or I doesn't mean that this was ever part of their core business.

Regardless, the overall sentimentality on HN about Google Reader and endless other indisputably small projects says more about the lack of strategic focus from people here, than it says anything about Alphabet.

veidr 9/2/2025|||
Well, "pivot" implies the core business has failed and you're like "oh shit, let's do X instead".

Stadia was just Google's New Coke, Apple's Mac Cube, or Microsoft's MSNBC (or maybe Zune.

When they can't sell ads anymore, they'll have to pivot.

swiftcoder 9/2/2025||
> Well, "pivot" implies the core business has failed and you're like "oh shit, let's do X instead".

I mean, Facebook's core business hasn't actually failed yet either, but their massive investments in short-form video, VR/XR/Metaverse, blockchain, and AI are all because they see their moat crumbling and are desperately casting around for a new field to dominate.

Google feels pretty similar. They made a very successful gambit into streaming video, another into mobile, and a moderately successful one into cloud compute. Now the last half a dozen gambits have failed, and the end of the road is in sight for search revenue... so one of the next few investments better pay off (or else)

devin 9/3/2025||
The link you posted has a great many very insignificant investments included in it, and nothing I've seen Google doing has felt quite like the desperation of Facebook in recent years.

I didn't really see it at first, but I think you are correct to point out that they kind of rhyme. However to me, I think the clear desperation of Facebook makes it feel rather different from what I've seen Google doing over the years. I'm not sure I agree that Google's core business is in jeopardy in the way that Facebook's aging social media platform is.

ethbr1 9/2/2025|||
Social. User-facing AI.
earth2mars 9/2/2025||
Social: YouTube User facing AI: Gemini, Google photos, NotebookLM and plenty of products.
spjt 9/2/2025||
I suppose you could argue that Amazon does have one special thing going for it here, idle compute resources in AWS. However that is not the sort of thing that requires "AI talent" to make use of.
swiftcoder 9/2/2025||
They also have made pretty big investments in cloud VMs with GPUs attached, so they are making money off the AI craze regardless
whatever1 9/2/2025||
The evidence shows that there is no methodological moat for LLMS. The moat of the frontier folks is just compute. xAI went in months from nothing to competing with the top dogs. DeepSeek too. So why bother with splurging billions in talent when you can buy GPUs and energy instead and serve the compute needs of everyone?

Also Amazon is in another capital intensive business. Retail. Spending billions on dubious AWS moonshots vs just buying more widgets and placing them across the houses of US customers for even faster deliveries does not make sense.

cedws 9/2/2025||
A lot of C-suite people seem to have an idea that if they just throw enough compute at LLMs that AGI will eventually emerge, even though it's pretty clear at this point that LLMs are never going to lead to general intelligence. In their view it makes sense to invest massive amounts of capital because it's like a lottery ticket to being the future AGI company that dominates the world.

I recall Zuckerberg saying something about how there were early signs of AI "improving itself." I don't know what he was talking about but if he really believes that's true and that we're at the bottom of an exponential curve then Meta's rabid hiring and datacenter buildout makes sense.

hliyan 9/2/2025|||
In early 2023, I remember someone breathlessly explaining that there are signs that LLMs that are seemingly good at chess/checkers moves may have a rudimentary model of the board within them, somehow magically encoded into the model weights through the training. I was stupid enough to briefly entertain the possibility until I actually bothered to develop a high level understanding of the transformer architecture. It's surprising how much mysticism this field seems to attract. Perhaps it being a non-deterministic, linguistically invoked black box, triggers the same internal impulses that draw some people to magic and spellcasting.
pegasus 9/2/2025|||
Just because it's not that hard to reach a high-level understanding of the transformer pipeline doesn't mean we understand how these systems function, or that there can be no form of world model that they are developing. Recently there has been more evidence for that particular idea [1]. The feats of apparent intelligence LLMs sometimes display have taken even their creators by surprise. Sure, there's a lot of hype too, that's part and parcel of any new technology today, but we are far from understanding what makes them perform so well. In that sense, yeah you could say they are a bit "magical".

[1] https://the-decoder.com/new-othello-experiment-supports-the-...

ath3nd 9/2/2025||
> Just because it's not that hard to reach a high-level understanding of the transformer pipeline doesn't mean we understand how these systems function

Mumbo jumbo magical thinking.

They perform so well because they are trained on probabilistic token matching.

Where they perform terribly, e.g math, reasoning, they are delegating to other approaches, and that's how you get the illusion that there is actually something there. But it's not. Faking intelligence is not intelligence. It's just text generation.

> In that sense, yeah you could say they are a bit "magical"

Nobody but the most unhinged hype pushers are calling it "magical". The LLM can never ever be AGI. Guessing the next word is not intelligence.

> there can be no form of world model that they are developing

Kind of impossible to form a world model if your foundation is probabilistic token guessing which is what LLMs are. LLMs are a dead end in achieving "intelligence", something novel as an approach needs to be discovered (or not) to go into the intelligence direction. But hey, at least we can generate text fast now!

whalee 9/2/2025||
> LLMs are a dead end in achieving "intelligence"

There is no evidence to indicate this is the case. To the contrary, all evidence we have points to these models, over time, being able to perform a wider range of tasks at a higher rate of success. Whether it's GPQA, ARC-AGI or tool usage.

> they are delegating to other approaches > Faking intelligence is not intelligence. It's just text generation.

It seems like you know something about what intelligence actually is that you're not sharing. If it walks, talks and quacks like a duck, I have to assume it's a duck[1]. Though, maybe it quacks a bit weird.

[1] https://en.wikipedia.org/wiki/Solipsism

ath3nd 9/3/2025||
> There is no evidence to indicate this is the case

Burden of proof is on those trying to convince us to buy into the idea of LLMs as being "intelligence".

There is no evidence of the Flying Spaghetti monster or Zeus or God not existing either, but we don't take seriously the people who claim they do exist (and there isn't proof because these concepts are made up).

Why should we take seriously the tolks claiming LLMs are intelligence without proof (there can't be proof, of course, because LLMs are not intelligence)?

ericmcer 9/2/2025||||
Is there something we are all missing? Using Claude feels like magic sometimes, but can't everyone see the limitation now that we are 4 years and 100s of billions down the road?

Are they still really hoping that they are gonna tweak a model and feed it an even bigger dataset and it will be AGI?

momojo 9/2/2025|||
I'm not a fan of mysticism. I'm also with you that these are simply statistical machines. But I don't understand what happened when understood transformers at a high-level.

If you're saying the magic disappeared after looking at a single transformer, did the magic of human intelligence disappear after you understood human neurons at a high level?

stuaxo 9/2/2025||||
Its insane really, anyone who has worked with LLMs for a bit and has an idea of how they work shouldn't think its going to lead to "AGI".

Hopefully some big players, like FB bankrupt themselves.

IanCal 9/2/2025|||
Tbh I find this view odd, and I wonder what people view as agi now. It used to be that we had extremely narrow pieces of AI and I remember being on a research project about architectures and just very basic “what’s going on?” was advanced. Understanding that someone asked a question, that would be solved by getting a book and being able to then go and navigate to the place the book was likely to be was fancy. Most systems could solve literally one type of problem. They weren’t just bad at other things they were fundamentally incapable of anything but an extremely narrow use case.

I can throw wide ranging problems at things like gpt5 and get what seem like dramatically better answers than if I asked a random person. The amount of common sense is so far beyond what we had it’s hard to express. It used to be always pointed out that the things we had were below basic insect level. Now I have something that can research a charity, find grants and make coherent arguments for them, read matrix specs and debug error messages, and understand sarcasm.

To me, it’s clear that agi is here. But then what I always pictured from it may be very different to you. What’s your image of it?

whizzter 9/2/2025|||
It's more that "random" people are dumb as bricks (but we've in the name of equality and historic measurement errors decided to forgo that), add to it that AI's have a phenomenal (internet sized) memory makes them far more capable than many people.

However, even "dumb" people can often make judgements structures in a way that AI's cannot, it's just that many have such a bad knowledge-base that they cannot build the structures coherently whereas AI's succeed thanks to their knowledge.

I wouldn't be surprised if the top AI firms today spend an inordinate amount of time to build "manual" appendages into the LLM systems to cater to tasks such as debugging to uphold the facade that the system is really smart, while in reality it's mostly papering up a leaky model to avoid losing the enormous investments they need to stay alive with a hope that someone on their staff comes up a real solution to self-learning.

https://magazine.sebastianraschka.com/p/understanding-reason...

adwn 9/2/2025||||
I think the discrepancy between different views on the matter mainly stems from the fact that state-of-the-art LLMs are better (sometimes extremely better) at some tasks, and worse (sometimes extremely worse) at other tasks, compared to average humans. For example, they're better at retrieving information from huge amounts of unstructured data. But they're also terrible at learning: any "experience" which falls out of the context window is lost forever, and the model can't learn from its mistakes. To actually make it learn something requires very many examples and a lot of compute, whereas a human can permanently learn from a single example.
andsoitis 9/2/2025||
> human can permanently learn from a single example

This, to me at least, seems like an important ingredient to satisfying a practical definition / implementation of AGI.

Another might be curiosity, and I think perhaps also agency.

Yoric 9/2/2025||||
I think it's clear that nobody agrees what AGI is. OpenAI describes it in terms of revenue. Other people/orgs in terms of, essentially, magic.

If I had to pick a name, I'd probably describe ChatGPT & co as advanced proof of concepts for general purpose agents, rather than AGI.

delecti 9/2/2025||
> I think it's clear that nobody agrees what AGI is

People selling AI products are incentivized to push misleading definitions of AGI.

boppo1 9/2/2025||||
Human-level intelligence. Being able to know what it doesn't know. Having a practical grasp on the idea of truth. Doing math correctly, every time.

I give it a high-res photo of a kitchen and ask it to calculate the volume of a pot in the image.

tomaskafka 9/2/2025|||
You discover truth by doing stuff in real world and observing the results. Current LLM have enough intelligence, but all the inputs they have are the “he said she said” by us monkeys, including all omissions and biases.
snapcaster 9/2/2025||||
But many humans can't do a lot of those things and we still consider them "generally intelligent"
293984j29384 9/2/2025|||
None of what you describe would I label within the realm of 'average'
swiftcoder 9/2/2025||
It's not about what the average human can do - it's about what humans as a category are capable of. There will always be outliers (in both directions), but you can, in general, teach a human a variety of tasks, such as performing arithmetic deterministically, that you cannot teach to, for example, a parrot.
audunw 9/3/2025||||
I don’t have a very high expectation of AGI at all. Just an algorithm or system you can put onto a robot dog, and get a dog level general intelligence. You should be able to live with that robot dog for 10 years and it should be just as capable as a dog throughout that timespan.

Hell, I’d even say we have AGI if you could emulate something like a hamster.

LLMs are way more impressive in certain ways than such a hypothetical AGI. But that has been true of computers for a long time. Computers have been much better at Chess than humans for decades. Dogs can’t do that. But that doesn’t mean that a chess engine is an AGI.

I would also say we have a special form of AGI if the AI can pass an extended Turing test. We’ve had chat bots that can fool a human for a minute for a long time. Doesn’t mean we had AGI. So time and knowledge was always a factor in a realistic Turing test. If an AGI can fool someone who knows how to properly probe an LLM, for a month or so, while solving a bunch of different real world tasks that require stable long term memory and planning, then I’d day we’re in AGI territory for language specifically. I think we have to distinguish between language AGI and multi-modal AGI. So this test wouldn’t prove what we could call “full” AGI.

These are some of the missing components for full AGI: - Being able to act as a stable agent with a stable personality over long timespans - Capable of dealing with uncertainties. Having a understanding of what it doesn’t know - One-shot learning, with long term retention, for a large number of things - Fully integrated multi-modality across sound, vision, and other inputs/outputs we may throw at it.

The last one is where we may be able to get at the root of the algorithm we’re missing. A blind person can learn to “see” by making clicks and using their ears to see. Animals can do similar “tricks”. I think this is where we truly see the full extent of the generality and adaptability of the biological brain. Imagine trying to make a robot that can exhibit this kind of adaptability. It doesn’t fit into the model we have for AI right now.

homarp 9/2/2025||||
my picture of AGI is 1) autonomous improvement 2) ability to say 'i don't know/can't be done'
dmboyd 9/2/2025||
I wonder if 2) is a result of published bias for positive results in the training set. An “I don’t know” response is probably ranked unsatisfactory by human feedback and most published scientific literature are biased towards positive results and factual explanations.
InitialLastName 9/2/2025||
In my experience, the willingness to say "I don't know" instead of confabulate is also down-rated as a human attribute, so it's not surprising that even an AGI trained on the "best" of humanity would avoid it.
AlienRobot 9/2/2025|||
Nobody is saying that LLM's don't work like magic. I know how neural networks work and they still feel like voodoo to me.

What we are saying is that LLM's can't become AGI. I don't know what AGI will look like, but it won't look like an LLM.

There is a difference between being able to melt iron and being able to melt tungsten.

thaawyy33432434 9/2/2025||||
Recently I realized that US are very close to a centrally planned economy. Meta wasted 50B on metaverse, which like how much Texas spends on healthcare. Now the "AI" investments seems dubious.

You could fund 1000+ projects with this kinds of money. This is not an effective capital allocation.

amelius 9/2/2025||||
The only way we'll have AGI is if people get dumber. Using modern tech like LLMs makes people dumber. Ergo, we might see AGI sooner than expected.
menaerus 9/2/2025||||
> ... and has an idea of how they work shouldn't think its going to lead to "AGI"

Not sure what level of understanding are you referring to but having learned and researched about the pretty much all LLM internals I think this has led me exactly to the opposite line of thinking. To me it's unbelievable what we have today.

janalsncm 9/2/2025||||
I think AI research is like anything else really. The smartest people are heads down working on their problems. The people going on podcasts are less connected to day to day work.

It’s also pretty useless to talk about whether something is AGI without defining intelligence in the first place.

foobarian 9/2/2025||||
I think something like we saw in the show "Devs" is much more likely, although what the developers did with it in the show was bonkers unrealistic. But some kind of big enough quantum device basically.
guardian5x 9/2/2025||||
Just scaling them up might not leat to "AGI", but they can still lead to AGI as a bridge.
meowface 9/2/2025||||
This is not and has not been the consensus opinion. If you're not an AI researcher you shouldn't write as if you've set your confidence parameter to 0.95.

Of course it might be the case, but it's not a thing that should be expressed with such confidence.

blackhaz 9/2/2025|||
Is it widely accepted that LLMs won't lead to AGI? I've asked Gemini, so it came up with four primary arguments for this claim, commenting on them briefly:

1) LLMs as simple "next token predictors" so they just mimicry thinking: But can it be argued that current models operate on layers of multiple depth and are able to actually understand by building concepts and making connections on abstract levels? Also, don't we all mimicry?

2) Grounding problem: Yes, models build their world models on text data, but we have models operating on non-textual data already, so this appears to be a technical obstacle rather than fundamental.

3) Lack of World Model. But can anyone really claim they have a coherent model of reality? There are flat-earthers, yet I still wouldn't deny them having AGI. People hallucinate and make mistakes all the time. I'd argue hallucinations is in fact the sign of an emerging intelligence.

4) Fixed learning data sets. Looks like this is now being actively solved with self-improving models?

I just couldn't find a strong argument supporting this claim. What am I missing?

globnomulous 9/2/2025|||
Why on earth would you copy and paste an LLM's output into a comment? What does that accomplish or provide that just a simply stated argument doesn't accomplish more succinctly? If you don't know something, simply don't comment on it -- or ask a question.
blackhaz 9/2/2025||
None of the above is AI.
globnomulous 9/4/2025||
> I've asked Gemini, so it came up with four primary arguments for this claim, commenting on them briefly:

This line means, and literally says, that everything that follows is a summary or direct quotation from an LLM's output.

There's a more charitable but unintuitive interpretation, in which "commenting on them briefly" is intended to mean "I will comment on them briefly:". But this isn't a natural interpretation. It's one I could be expected to reach only after seeing your statement that 'none of the above is AI.' But even this more charitable interpretation actually contradicts your claim that it's not AI.

So now I'm even less sure I know what you meant to communicate. Either I'm missing something really obvious or the writing doesn't communicate what you intended.

welferkj 9/2/2025|||
Fur future reference, pasting llm slop feels exactly as patronizing as back when people pasted links to google searches in response to questions they considered beneath their dignity to answer. Except in this case, no-one asked to begin with.
qcnguy 9/2/2025||||
> I don't know what he was talking about

There's a bunch of ways AI is improving itself, depending on how you want to interpret that. But it's been true since the start.

1. AI is used to train AI. RLHF uses this, curriculum learning is full of it, video model training pipelines are overflowing with it. AI gets used in pipelines to clean and upgrade training data a lot.

2. There are experimental AI agents that can patch their own code and explore a tree of possibilities to boost their own performance. However, at the moment they tap out after getting about as good as open source agents, but before they're as good as proprietary agents. There isn't exponential growth. There might be if you throw enough compute at it, but this tactic is very compute hungry. At current prices it's cheaper to pay an AI expert to implement your agent than use this.

Eggpants 9/2/2025|||
So have an AI with a 40% error rate judge an AI with an 40% error rate…

AGI is a complete no go until a model can adjust its own weights on the fly, which requires some kind of negative feedback loop, which requires a means to determine a failure.

Humans have pain receptors to provide negative feedback and we can imagine events that would be painful such as driving into a parked car would be painful without having to experience it.

If current models could adjust its own weights to fix the famous “how many r’s in strawberry” then I would say we are on the right path.

However, the current solution is to detect the question and forward it to a function to determine the right answer. Or attempt to add more training data the next time the model is generated ($$$). Aka cheat the test.

mitjam 9/2/2025||||
I think LLM as a toolsmith like demonstrated in the Voyager paper (1) is another interesting approach to creating a system that can learn to do a task better over time. (1) https://arxiv.org/abs/2305.16291
Yoric 9/2/2025||||
> There are experimental AI agents that can patch their own code and explore a tree of possibilities to boost their own performance. However, at the moment they tap out after getting about as good as open source agents, but before they're as good as proprietary agents.

Interesting. Do you have links?

torbab 9/2/2025||
Not OP, but the Darwin Godel machine comes to mind: https://arxiv.org/abs/2505.22954
qcnguy 9/2/2025|||
That's the one!
Yoric 9/3/2025|||
Thanks!
franktankbank 9/2/2025|||
I'm skeptical that RLHF really works. Doesn't it just patch the obvious holes so it looks better on paper? If it can't reason then it will continue to get 2nd and 3rd order difficulty problems wrong.
abraxas 9/2/2025||||
> it's pretty clear at this point that LLMs are never going to lead to general intelligence.

It is far from clear. There may well be emergent hierarchies of more abstract thought at much higher numbers of weights. We just don't know how a transformer will behave if one is built with 100T connections - something that would finally approach the connectome level of a human brain. Perhaps nothing interesting but we just do not know this and the current limitation in building such a beast is likely not software but hardware. At these scales the use of silicon transistors to approximate analog curve switching models just doesn't make sense. True neuromorphic chips may be needed to approach the numbers of weights necessary for general intelligence to emerge. I don't think there is anything in production at the moment that could rival the efficiency of biological neurons. Most likely we do not need that level of efficiency. But it's almost certain that stringing together a bunch of H100s isn't a path to the scale we should be aiming for.

epolanski 9/2/2025||||
I don't get it, I really don't.

Even assuming a company gets to AGI first this doesn't mean another one will follow.

Suppose that FooAI gets to it first: - competitors may get there too in a different or more efficient way - Some FooAI staff can leave and found their own company - Some FooAI staff can join a competitor - FooAI "secret sauce" can be figured out, or simply stolen, by a competitor

At the end of the day, it really doesn't matter, the equation AI === commodity just does not change.

There is no way to make money by going into this never ending frontier model war, price of training keeps getting higher and higher, but your competitors few months later can achieve your own results for a fraction of your $.

cedws 9/2/2025||
Some would say that the race to AGI is like the race to nuclear weapons and that the first to get there will hold all the cards (and be potentially able to stop others getting there.) It's a bit too sci-fi for me.
Yossarrian22 9/2/2025||
If AGI is reached it would be trivial for the competing superpowers to completely quarantine themselves network wise by cutting undersea cables long enough to develop competing AGI
CrossVR 9/2/2025|||
I don't know if AGI will emerge from LLM, but I'm always reminded of the Chinese room thought experiment. With billions thrown at the idea it will certainly be the ultimate answer as to whether true understanding can emerge from a large enough dictionary.
torginus 9/2/2025||
Please stop refering to the Chinese Room - it's just magical/deist thinking in disguise. It postulates that humans have way of 'understanding' things that is impossible to replicate mechanically.

The fact that philosophy hasn't recognized and rejected this argument based on this speaks volumes of the quality of arguments accepted there.

(That doesn't mean LLMs are or will be AGI, its just this argument is tautological and meaningless)

armada651 9/2/2025|||
That some people use the Chinese Room to ascribe some magical properties to human consciousness says more about the person drawing that conclusion than the thought experiment itself.

I think it's entirely valid to question whether a computer can form an understanding through deterministically processing instructions, whether that be through programming language or language training data.

If the answer is no, that shouldn't lead to a deist conclusion. It can just as easily lead to the conclusion that a non-deterministic Turing machine is required.

torginus 9/2/2025||
I'd appreciate if you tried to explain why instead of resorting to ad hominem.

> I think it's entirely valid to question whether a computer can form an understanding through deterministically processing instructions, whether that be through programming language or language training data.

Since the real world (including probabilistic and quantum phenomena) can be modeled with deterministic computation (a pseudorandom sequence is deterministic, yet simulates randomness), if we have a powerful enough computer we can simulate the brain to a sufficient degree to have it behave identically as the real thing.

The original 'Chinese Room' experiment describes a book of static rules of Chinese - which is probably not the way to go, and AI does not work like that. It's probabilistic in its training and evaluation.

What you are arguing is that constructing an artificial consciousness lies beyond our current computational ability(probably), and understanding of physics (possibly), but that does not rule out that we might solve these issues at some point, and there's no fundamental roadblock to artificial consciousness.

I've re-read the argument (https://en.wikipedia.org/wiki/Chinese_room) and I cannot help but conclude that Searle argues that 'understanding' is only something that humans can do, which means that real humans are special in some way a simulation of human-shaped atoms are not.

Which is an argument for the existence of the supernatural and deist thinking.

CrossVR 9/2/2025|||
> I'd appreciate if you tried to explain why instead of resorting to ad hominem.

It is not meant as an ad hominem. If someone thinks our current computers can't emulate human thinking and draws the conclusion that therefore humans have special powers given to them by a deity, then that probably means that person is quite religious.

I'm not saying you personally believe that and therefore your arguments are invalid.

> Since the real world (including probabilistic and quantum phenomena) can be modeled with deterministic computation (a pseudorandom sequence is deterministic, yet simulates randomness), if we have a powerful enough computer we can simulate the brain to a sufficient degree to have it behave identically as the real thing.

The idea that a sufficiently complex pseudo-random number generator can emulate real-world non-determinism enough to fully simulate the human brain is quite an assumption. It could be true, but it's not something I would accept as a matter of fact.

> I've re-read the argument (https://en.wikipedia.org/wiki/Chinese_room) and I cannot help but conclude that Searle argues that 'understanding' is only something that humans can do, which means that real humans are special in some way a simulation of human-shaped atoms are not.

In that same Wikipedia article Searle denies he's arguing for that. And even if he did secretly believe that, it doesn't really matter, because we can draw our own conclusions.

Disregarding his arguments because you feel he holds a hidden agenda, isn't that itself an ad hominem?

(Also, I apologize for using two accounts, I'm not attempting to sock puppet)

torginus 9/2/2025||
What are his arguments then?

>Searle argues that, without "understanding" (or "intentionality"), we cannot describe what the machine is doing as "thinking" and, since it does not think, it does not have a "mind" in the normal sense of the word.

This is the only sentence that seems to be pointing to what constitutes the specialness of humans, and the terms of 'understanding' and 'intentionality' are in air quotes so who knows? This sounds like the archetypical no true scotsman fallacy.

In mathematical analysis, if we conclude that the difference between 2 numbers is smaller than any arbitrary number we can pick, those 2 numbers must be the same. In engineering, we can reduce the claim to 'any difference large about to care about'

Likewise if the difference between a real human brain and an arbitrarily sophisticated Chinese Room brain is arbitrarily small, they are the same.

If our limited understanding of physics and engineering makes the practical difference not zero, this essentially becomes a bit of a somewhat magical 'superscience' argument claiming we can't simulate the real world to a good enough resolution that the meaningful differences between our 'consciousness simulator' and the thing itself disappear - which is an extraordinary claim.

CrossVR 9/2/2025||
> What are his arguments then?

They're in the "Complete Argument" section of the article.

> This sounds like the archetypical no true scotsman fallacy.

I get what you're trying to say, but he is not arguing only a true Scotsman is capable of thought. He is arguing that our current machines lack the required "causal powers" for thought. Powers that he doesn't prescribe to only a true Scotsman, though maybe we should try adding bagpipes to our AI just to be sure...

torginus 9/2/2025||
Thanks, but that makes his arguments even less valid.

He argues that computer programs only manipulate symbols and thus have no semantic understanding.

But that's not true - many programs, like compilers that existed back when the argument was made, had semantic understanding of the code (in a limited way, but they did have some understanding about what the program did).

LLMs in contrast have a very rich semantic understanding of the text they parse - their tensor representations encode a lot about each token, or you can just ask them about anything - they might not be human level at reading subtext, but they're not horrible either.

CrossVR 9/2/2025||
Now you're getting to the heart of the thought experiment. Because does it really understand the code or subtext, or is it just really good at fooling us that it does?

When it makes a mistake, did it just have a too limited understanding or did it simply not get lucky with its prediction of the next word? Is there even a difference between the two?

I would like to agree with you that there's no special "causal power" that Turing machines can't emulate. But I remain skeptical, not out of chauvinism, but out of caution. Because I think it's dangerous to assume an AI understands a problem simply because it said the right words.

dahart 9/2/2025|||
> I cannot help but conclude that Searle argues that ‘understanding’ is only something that humans can do, which means…

Regardless of whether Searle is right or wrong, you’ve jumped to conclusions and are misunderstanding his argument and making further assumptions based on your misunderstanding. Your argument is also ad-hominem by accusing people of believing things they don’t believe. Maybe it would be prudent to read some of the good critiques of Searle before trying to litigate it rapidly and sloppily on HN.

The randomness stuff is very straw man, definitely not a good argument, best to drop it. Today’s LLMs are deterministic, not random. Pseudorandom sequences come in different varieties, but they model some properties of randomness, not all of them. The functioning of today’s neural networks, both training and inference, is exactly a book of static rules, despite their use of pseudorandom sequences.

In case you missed it in the WP article, most of the field of cognitive science thinks Searle is wrong. However, they’re largely not critiquing him for using metaphysics, because that’s not his argument. He’s arguing that biology has mechanisms that binary electronic circuitry doesn’t; not human brains, simply physical chemical and biological processes. That much is certainly true. Whether there’s a difference in theory is unproven. But today currently there absolutely is a difference in practice, nobody has ever simulated the real world or a human brain using deterministic computation.

torginus 9/2/2025||
If scientific consensus is that he's wrong why is he being constantly brought up and defended - am I not right to call them out then?

Nobody brings up that light travels through the aether, that diseases are caused by bad humors etc. - is it not right to call out people for stating theory that's believed to be false?

>The randomness stuff is very straw man,

And a direct response to what armada651 wrote:

>I think it's entirely valid to question whether a computer can form an understanding through deterministically processing instructions, whether that be through programming language or language training data.

> He’s arguing that biology has mechanisms that binary electronic circuitry doesn’t; not human brains, simply physical chemical and biological processes.

Once again the argument here changed from 'computers which only manipulate symbols cannot create consciousness' to 'we don't have the algorithm for consiousness yet'.

He might have successfully argued against the expert systems of his time - and true, mechanistic attempts at language translation have largely failed - but that doesn't extend to modern LLMs (and pre LLM AI) or even statistical methods.

dahart 9/2/2025||
You’re making more assumptions. There’s no “scientific consensus” that he’s wrong, there are just opinions. Unlike the straw man examples you bring up, nobody has proven the claims you’re making. If they had, then the argument would go away like the others you mentioned.

Where did the argument change? Searle’s argument that you quoted is not arguing that we don’t have the algorithm yet. He’s arguing that the algorithm doesn’t run on electrical computers.

I’m not defending his argument, just pointing out that yours isn’t compelling because you don't seem to fully understand his, at least your restatement of it isn’t a good faith interpretation. Make his argument the strongest possible argument, and then show why it doesn’t work.

IMO modern LLMs don’t prove anything here. They don’t understand anything. LLMs aren’t evidence that computers can successfully think, they only prove that humans are prone to either anthropomorphic hyperbole, or to gullibility. That doesn’t mean computers can’t think, but I don’t think we’ve seen it yet, and I’m certainly not alone there.

torginus 9/3/2025||
>most of the field of cognitive science thinks Searle is wrong.

>There’s no “scientific consensus” that he’s wrong, there are just opinions.

dahart 9/3/2025||
And? Are you imagining that these aren’t both true at the same time? If so, I’m happy to explain. Since nothing has been proven, there’s nothing “scientific”. And since there’s some disagreement, “consensus” has not been achieved yet. This is why your presumptive use of “scientific consensus” was not correct, and why the term “scientific consensus” is not the same thing as “most people think”. A split of 60/40 or 75/25 or even 90/10 counts as “most” but does not count as “consensus”. So I guess maybe be careful about assuming what something means, it seems like this thread was limited by several incorrect assumptions.
globnomulous 9/2/2025||||
> The fact that philosophy hasn't recognized and rejected this argument based on this speaks volumes of the quality of arguments accepted there.

That's one possibility. The other is that your pomposity and dismissiveness towards the entire field of philosophy speaks volumes on how little you know about either philosophical arguments in general or this philosophical argument in particular.

torginus 9/2/2025||
Another ad hominem, I'd like you to refute my claim that Searle's argument is essentially 100% magical thinking.

And yes, if for example, medicine would be no worse at curing cancer than it is today, yet doctors asserted that crystal healing is a serious study, that would reflect badly on the field at large, despite most of it being sound.

dahart 9/2/2025|||
Searle refutes your claim that there’s magical thinking.

“Searle does not disagree with the notion that machines can have consciousness and understanding, because, as he writes, "we are precisely such machines". Searle holds that the brain is, in fact, a machine, but that the brain gives rise to consciousness and understanding using specific machinery.”

torginus 9/2/2025||
But the core of the original argument is that programs only manipulate symbols and consciousness can never arise just through symbol manipulation - which here then becomes 'we have not discovered the algorithms' for consciousness yet.

It's just a contradiction.

dahart 9/2/2025||
When you say something that contradicts his statements, it doesn’t mean he’s wrong, it most likely means you haven’t understood or interpreted his argument correctly. The Wikipedia page you linked to doesn’t use the word “algorithm”, so the source of the contradiction you imagine might be you. Searle says he thinks humans are biological machines, so your argument should withstand that hypothesis rather than dismiss it.
globnomulous 9/2/2025||||
Why on earth do you take it as an ad hominem attack? Do you really think your comment isn't dismissive or pompous?
malfist 9/2/2025|||
Another ad hominem, just like you calling anyone who talks about the chinese room thought experiment a deist?
simianparrot 9/2/2025||||
It is still relevant because it hasn’t been disproven yet. So far all computer programs are Chinese Rooms, LLM’s included.
IanCal 9/2/2025||
If you’re talking about it being proven or disproven you’re misunderstanding the point of the thought experiment.
herculity275 9/2/2025||||
"Please stop referring to this thought experiment because it has possible interpretations I don't personally agree with"
torginus 9/2/2025||
Please give me an interpretation that is both correct an meaningful (as in possible to disprove)
welferkj 9/2/2025|||
The human way of understanding things can be replicated mechanically, because it is mechanical in nature. The contents of your skull are an existence proof of AGI.
hiatus 9/2/2025|||
The A stands for artificial.
armada651 9/2/2025|||
The contents of my skull are only a proof for AGI if your mechanical machine replicates all its processes. It's not a question about whether a machine can reproduce that, it's a question about whether we have given our current machines all the tools it needs to do that.
torginus 9/2/2025||
The theory of special relativity does not say 'you can't exceed the speed of light(unless you have a really big rocket)'. It presents a theoretical limit. Likewise the Chinese room doesn't state that consciousness is an intractable engineering problem, but an impossibility.

But the way Searle formulates his argument, by not defining what consciousness is, he essentially gives himself enough wiggle room to be always right - he's essentially making the 'No True Scotsman' fallacy.

bhl 9/2/2025|||
The moat is people, data, and compute in that order.

It’s not just compute. That has mostly plateaued. What matters now is quality of data and what type of experiments to run, which environments to build.

sigmoid10 9/2/2025|||
This "moat" is actually constantly shifting (which is why it isn't really a moat to begin with). Originally, it was all about quality data sources. But that saturated quite some time ago (at least for text). Before RLHF/RLAIF it was primarily a race who could throw more compute at a model and train longer on the same data. Then it was who could come up with the best RL approach. Now we're back to who can throw more compute at it since everyone is once again doing pretty much the same thing. With reasoning we now also opened a second avenue where it's all about who can throw more compute at it during runtime and not just while training. So in the end, it's mostly about compute. The last years have taught us that any significant algorithmic improvement will soon permeate across the entire field, no matter who originally invented it. So people are important for finding this stuff, but not for making the most of it. On top of that, I think we are very close to the point where LLMs can compete with humans on their own algorithmic development. Then it will be even more about who can spend more compute, because there will be tons of ideas to evaluate.
DrScientist 9/2/2025|||
To put that into a scientific context - compute is capacity to do experiments and generate data ( about how best to build models ).

However I do think you are missing an important aspect - and that's people who properly understand important solvable problems.

ie I see quite a bit "we will solve this x, with AI' from startup's that don't fundamentally understand x.

sigmoid10 9/2/2025||
>we will solve this x, with AI

You usually see this from startup techbro CEOs understand neither x nor AI. Those people are already replacable by AI today. The kind of people who think they can query ChatGPT once with "How to create a cutting edge model" and make millions. But when you go in on the deep end, there are very few people who still have enough tech knowledge to compete with your average modern LLM. And even the Math Olympiad gold medalists high-flyers at DeepSeek are about to have a run for their money with the next generation. Current AI engineers will shift more and more towards senior architecture and PM roles, because those will be the only ones that matter. But PM and architecture is already something that you could replace today.

bhl 9/3/2025|||
> Originally, it was all about quality data sources.

It still is! Lots of vertical productivity data that would be expensive to acquire manually via humans will be captured by building vertical AI products. Think lawyers, doctors, engineers.

sigmoid10 9/5/2025||
That's literally what RLAIF has been doing for a while now.
ActionHank 9/2/2025|||
People matter less and less as well.

As more opens up in OSS and academic space, their knowledge and experience will either be shared, rediscovered, or become obsolete.

Also many of the people are coasting on one or two key discoveries by a handful of people years ago. When Zuck figures this out he gonna be so mad.

bhl 9/3/2025||
Not all will become OSS. Some will become products, and that requires the best people.
ml-anon 9/2/2025|||
Lets not pretend this is strategy. Amazon has been trying and failing to hire top AI people. No-one in their right minds would join. Even Meta has to shell out 8-9 figures for top people, who with any modicum of talent or self respect would go to Amazon rather than Anthropic, OAI, GDM? They bought Adept, everyone left.

AWS is also falling far behind Azure wrt serving AI needs at the frontier. GCP is also growing at a faster rate and has a way more promising future than AWS in this space.

mikert89 9/2/2025||
AWS is very far behind, its already impacting the stock. Without a winning AI offering, all new cloud money is going to GCP and Azure. They have a huge problem
Lyapunov_Lover 9/2/2025|||
> The evidence shows that there is no methodological moat for LLMS.

Does it? Then how come Meta hasn't been able to release a SOTA model? It's not for a lack of trying. Or compute. And it's not like DeepSeek had access to vastly more compute than other Chinese AI companies. Alibaba and Baidu have been working on AI for a long time and have way more money and compute, but they haven't been able to do what DeepSeek did.

postexitus 9/2/2025||
They may not have been leading (as in, releasing a SOTA model), but they definitely can match others - easily, as shown by llama 3/4, which proves the point - there is no moat. With enough money and resources, you can match others. Whether without SOTA models you can make a business out of it is a different question.
Lyapunov_Lover 9/2/2025|||
Meta never matched the competition with their Llama models. They've never even come close. And Llama 4 was an actual disaster.
postexitus 9/2/2025||
I am not a daily user, so only rely on reviews and benchmarks - actual experience may be different.
YetAnotherNick 9/2/2025||
Even in reviews and benchmark, Llama wasn't close to frontier models. Also Llama 2/3 lead in open weight models wasn't more than few months.
ath3nd 9/2/2025|||
> but they definitely can match others - easily, as shown by llama 3/4

Are we living in the same universe? LLAMA is universally recognized as one of the worst and least successful model releases. I am almost certain you haven't ever tried a LLAMA chat, because, by the beard of Thor, it's the worst experience anyone could ever had, with any LLM.

LLAMA 4 (behemoth, whatever, whatever) is an absolute steaming pile of trash, not even close to ChatGPT 4o/4/5/, Gemini(any) and even not even close to cheaper ones like DeepSeek. And to think Meta pirated torrents to train it...

What a bunch of criminal losers and what a bunch of waste of money, time and compute. Oh, at least the Metaverse is a success...

https://www.pcgamer.com/gaming-industry/court-documents-show...

https://www.cnbc.com/2025/06/27/the-metaverse-as-we-knew-it-...

karterk 9/2/2025|||
> The moat of the frontier folks is just compute.

This is not really true. Google has all the compute but in many dimensions they lag behind GPT-5 class (catching up, but it has not been a given).

Amazon itself did try to train a model (so did Meta) and had limited success.

empiko 9/2/2025|||
I switched to Gemini with my new phone and I literally couldn't tell a difference. It is actually crazy how small the cost of switching is for LLMs. It feels like AI is more like a commodity than a service.
lelanthran 9/2/2025|||
> I switched to Gemini with my new phone and I literally couldn't tell a difference. It is actually crazy how small the cost of switching is for LLMs. It feels like AI is more like a commodity than a service.

It is. It's wild to me that all these VCs pouring money into AI companies don't know what a value-chain is.

Tokens are the bottom of the value-chain; it's where the lowest margins exist because the product at that level is a widely available commodity.

I wrote about this already (shameless plug: https://www.rundata.co.za/blog/index.html?the-ai-value-chain )

physicsguy 9/2/2025|||
On top of that, the on-device models have got stronger and stronger as the base models + RL has got better. You can do on your laptop now what 2 years ago was state of the art.
gnfargbl 9/2/2025||||
Which dimensions do you see Google lagging on? They seem broadly comparable on the usual leaderboard (https://lmarena.ai/leaderboard) and anecdotally I can't tell the difference in quality.

I tend personally to stick with ChatGPT most of the time, but only because I prefer the "tone" of the thing somehow. If you forced me to move to Gemini tomorrow I wouldn't be particularly upset.

motorest 9/2/2025||
> Which dimensions do you see Google lagging on? They seem broadly comparable on the usual leaderboard (https://lmarena.ai/leaderboard) and anecdotally I can't tell the difference in quality.

Gemini holds indeed the top spot, but I feel you framed your response quite well: they are all broadly comparable. The difference in the synthetic benchmark from the top spot and the 20th spot was something like 57 points on a scale of 0-1500

Keyframe 9/2/2025||||
" in many dimensions they lag behind GPT-5 class " - such as?

Outside of computer, "the moat" is also data to train on. That's an even wider moat. Now, google has all the data. Data no one else has or ever will have. If anything, I'd expect them to outclass everyone by a fat margin. I think we're seeing that on video however.

ivape 9/2/2025|||
You think Chinese companies are short on data and people? Google doesn’t have an advantage there until the CCP takes on a more hands on approach.

Tin foil hat time:

- If you were a God and you wanted to create an ideal situation for the arrival of AI

- It would make sense to precede it with a social media phenomena that introduces mass scale normalization of sharing of personal information

Yes, that would be ideal …

People can’t stop sharing and creating data on anything, for awhile now. It’s a perfect situation for AI as an independent, uncontrollable force.

rusk 9/2/2025||
> People can’t stop sharing and creating data on anything

Garbage in. Garbage out.

There has never been a better time to produce an AI that mimics a racist uneducated teenager.

Loudergood 9/2/2025|||
We already had Tay. https://en.wikipedia.org/wiki/Tay_(chatbot)
ivape 9/2/2025|||
Do you want to model the world accurately or not? That person is part of our authentic reality. The most sophisticated AI in the world will always include that person(s).
rusk 9/2/2025||
Not in the slightest. I want useful information services that behave in a mature and respectable fashion.
willvarfar 9/2/2025||||
not according to google: “We have no moat, and neither does OpenAI”: the big memo and the big HN thread on same https://news.ycombinator.com/item?id=35813322
Keyframe 9/2/2025|||
a bit weird to think about it since google has literally internet.zip in multiple versions over the years, all of email, all of usenet, all of the videos, all of the music, all of the user's search interest, ads, everything..
lelanthran 9/2/2025|||
> a bit weird to think about it since google has literally internet.zip in multiple versions over the years, all of email, all of usenet, all of the videos, all of the music, all of the user's search interest, ads, everything..

Yeah, Google totally has a moat. Them saying that they have no moat doesn't magically make that moat go away.

They also own the entire vertical which none of the competitors do - all their competitors have to buy compute from someone who makes a profit just on compute (Nvidia, for example). Google owns the entire vertical, from silicon to end-user.

It would be crazy if they can't make this work.

rvba 9/2/2025||||
That's why robots make so much traffic now. Those other companies are trying to get data.

Google theoretically has reddit access. I wonder if they have sort of an internet archive - data unpolutted by LLMs

On a side note, funny how all the companies seem to train on book archivr which they just downloaded from the internet

lrem 9/2/2025|||
> all of the videos, [...], all of the user's search interest, ads, everything..

And privacy policies that are actually limiting what information gets used in what.

Keyframe 9/2/2025||
and even then!
IncreasePosts 9/2/2025|||
That's one person's opinion that works for Google.
seunosewa 9/2/2025|||
counterpoint: with their aggressive crawlers, most AI companies can have as much data as google...
motorest 9/2/2025||||
> This is not really true. Google has all the compute but in many dimensions they lag behind GPT-5 class (catching up, but it has not been a given).

I don't know what you are talking about. I use Gemini on a daily basis and I honestly can't tell a difference.

We are at a point where training corpus and hallucinations makes more of a difference than "model class".

jorisboris 9/2/2025||||
Yes, or Apple who with all the talent don’t manage to pull off anything useful in AI

xAI seems to be the exception, not the rule

rusk 9/2/2025||
Given Apple’s moat is their devices, their particular spin on AI is very much edge focussed, which isn’t as spectacular as the current wave of cloud based LLM. Apple’s cloud stuff is laughably poor.
jeanloolz 9/2/2025||||
Depending on how you look at it I suppose but I believe Gemini surpasses OpenAI on many levels now. Better photo and video models. The leaderboard for text and embeddings are also putting Google on top of Openai.
ebonnafoux 9/2/2025||||
gemini-2.5-pro is ranked number 1 in llmarena (https://lmarena.ai/leaderboard) before gpt-5-high. In the Text-to-Video and Image-to-video, google also have the highest places, OpenAI is nowhere.
IX-103 9/2/2025||
Yes, but they're also slower. As LLMs start to be used for more general purpose things, they are becoming a productivity bottle-neck. If I get a mostly right answer in a few seconds that's much better than a perfect answer in 5 minutes.

Right now the delay for Google's AI coding assistant is high enough for humans to context switch and do something else while waiting. Particularly since one of the main features of AI code assistants is rapid iteration.

janalsncm 9/2/2025||
Anecdotally, Gemini pro is way faster than GPT 5 thinking. Flash is even faster. I have no numbers though.
paulddraper 9/2/2025|||
It doesn’t guarantee success, but the point stands about X and Deepseek
DrScientist 9/2/2025|||
I think I'm right in saying that AWS, rather than deliveries, is by far the most profitable part of Amazon.

Also a smart move is to be selling shovels in a gold rush - and that's exactly what Amazon is doing with AWS.

jojobas 9/2/2025|||
Amazon retail runs on ridiculously low margins compared to AWS. Revenue-wise retail dwarfs AWS, profit-wise it's vice-versa.
StopDisinfo910 9/2/2025|||
The barriers to entry for LLM are obvious: as you pointed, the field is extremely capital intensive. The only reason there are seemingly multiple players is because the amount of capital thrown at it at the moment is tremendous but that's unlikely to last forever.

From my admittely poorly informed point of view, strategy-wise, it's hard to tell how wise it is investing in foundational work at the moment. As long as some players release competitive open weight models, the competitive advantage of being a leader in R&D will be limited.

Amazon already has the compute power to place itself as a reseller without investing or having to share the revenue generated. Sure, they won't be at the forefront but they can still get their slice of the pie without exposing themselves too much to an eventual downturn.

abtinf 9/2/2025|||
The idea that models are copyrightable is also extremely dubious.

So there probably isn’t even a legal moat.

energy123 9/2/2025|||
There's not much of an architectural moat, but there is a methodological moat, such as with RL synthetic data.
VirusNewbie 9/2/2025||
Are you arguing anthropic has more compute than Amazon?

Are you saying the only reason Meta is behind everyone else is compute????

benterix 9/2/2025|||
Think well: why should a platform provider get into a terribly expensive and unprofitable business when they can just provide hardware for those with money to spend? This was AWS strategy for years and it's been working well for them.
motorest 9/2/2025|||
> Are you arguing anthropic has more compute than Amazon?

I wouldn't be surprised if the likes of Anthropic wasn't paying AWS for its compute.

As the saying goes, the ones who got rich from the gold rush were the ones selling shovels.

ospray 9/2/2025||
I wouldn't be surprised if Amazon just buys Anthropic or another lab rather than competing for individuals.
lizknope 9/1/2025||
Does Amazon want to be an AI innovator or an AI enabler?

AWS enables thousands of other companies to run their business. Amazon has designed their own Graviton ARM CPUS and their own Trainium AI chips. You can access these through AWS for your business.

I think Amazon sees AI being used in AWS as a bigger money generator than designing new AI algorithms.

DoesntMatter22 9/1/2025||
Also I think that they realize this is just a money losing proposition right now for the most part. And they're not going to have a problem getting in later when there's a clear solution. Why fight it out? I don't think they're going to miss much because they can use any models they need and as you said some of that stuff may be run on their servers
coredog64 9/2/2025||
I can make a case: Building their own models like Nova and Titan allow them to build up expertise in how to solve hyperscaler problems. Think of it like Aurora, where they have a generally solved problem (RDBMS) but it needs to be modified to work with the existing low-level primitives. Yes, it can be done in the open, but if I'm AWS, I probably want to jealously guard anything that could be a key differentiator.
PartiallyTyped 9/2/2025|||
Reading comments from the appropriate VPs will illuminate the situation.. Swami is looking to democratise AI, and the company is geared towards that more than anything else.

Disclaimer; I work for amzn, opinions my own.

https://aws.amazon.com/blogs/machine-learning/aws-and-mistra...

mips_avatar 9/2/2025|||
I don't know what democratizing AI means, AWS doesn't have the GPU infrastructure to host inference or training on a large scale.
lizknope 9/2/2025|||
I started this part of the thread and mentioned Trainium but the person you replied to gave a link. Follow that and you can see Amazon's chips that they designed.

Amazon wants people to move away from Nvidia GPUs and to their own custom chips.

https://aws.amazon.com/ai/machine-learning/inferentia/

https://aws.amazon.com/ai/machine-learning/trainium/

mips_avatar 9/2/2025||
TBH I was just going off of that I've heard AWS is a terrible place to get h100 clusters at scale. And for the training I was looking at we didn't really want to consider going off CUDA.
PartiallyTyped 9/2/2025|||
Huh? That’s quite the assertion. They provide the infrastructure for Anthropic, so if that’s not large scale idk what is.
ZeroCool2u 9/2/2025||
They have to use GCP as well, which is arguably a strong indictment of their experience with AWS. Coincidentally, this aligns with my experience trying to train on AWS.
JCM9 9/2/2025|||
It’s unclear why Swami is put in charge of this stuff. He’s not a recognized leader in the space and hasn’t delivered a coherent strategy. However, per the article Amazon is struggling to hire and retain the best talent and thus it may just be the best they have.
code4tee 9/2/2025||
Who is “Swami?” Although I suppose that’s just making the point that Amazon’s folks aren’t recognized leaders in this space.
justinator 9/2/2025|||
Selling pick axes vs. mining for gold yet again!
dangus 9/2/2025||
I'm glad this analogy is at the top. I think that some large companies like AWS really should not try to blow money on AI in ways that only make a lot more sense for companies like Meta, Google, and Apple. AWS can't trap you in their AI systems with network effects that the other competitors can.

Companies like OpenAI and Anthropic are still incredibly risky investments especially because of the wild capital investments and complete lack of moat.

At least when Facebook was making OpenAI's revenue numbers off of 2 billion active users it was trapping people in a social network where there were real negative consequences to leaving. In the world of open source chatbots and VSClone forks there's zero friction to moving on to some other solution.

OpenAI is making $12 billion a year off of 700 million users [1], or around $17 per user annually. What other products that have no ad support perform that badly? And that's a company that is signing enterprise contracts with companies like Apple, not just some Spotify-like consumer service.

[1] This is almost the exact same user count that Facebook had when it turned its first profit.

jsnell 9/2/2025||
> OpenAI is making $12 billion a year off of 700 million users [1], or around $17 per user annually. What other products that have no ad support perform that badly?

That's a bit of a strange spin. Their ARPU is low because they are choosing not to monetize 95% of their users at all, and for now are just providing practically limitless free service.

But monetising those free users via ads will pretty obviously be both practical and lucrative.

And even if there is no technical moat, they seem to have a very solid mind share moat for consumer apps. It isn't enough for competitors to just catch up. They need to be significantly better to shift consumer habits.

(For APIs, I agree there is no moat. Switching is just so easy.)

chii 9/2/2025|||
> They need to be significantly better to shift consumer habits.

i am hoping that a device local model would eventually be possible (may be a beefy home setup, and then an app that connects to your home on mobile devices for use on the go).

currently, hardware restrictions prevent this type of home setup (not to mention the open source/free models aren't quite there and difficulty for non-tech users to actually setup). However, i choose to believe the hardware issues will get solved, and it will merely be just time.

The software/model issue, on the other hand is harder to see solved. I pin my hopes onto deepseek, but may be meta or some other company will surprise me.

dangus 9/3/2025||
I think you're super wrong about the local model issue and that's a huge risk for companies like OpenAI.

Apple products as an example have an excellent architecture for local AI. Extremely high-bandwidth RAM.

If you run an OSS model like gpt-oss on a Mac with 32GB of RAM it's already very similar to a cloud experience.

chii 9/4/2025||
i dont have the hardware to run or try them, but from the huggingfaces discussion forums, gpt-oss seems to be pretty hard censored. I would not consider it as being a viable self-hosted LLM except for the very narrowest of domains (like coding for example).
dangus 9/4/2025||
I'm not sure where censorship comes in with this discussion, it seems like cloud models are censored as well? And local models are frequently created that are abliterated? Correct me if I'm wrong or misunderstanding you.

Either way, it's just an example model, plenty of others to choose from. The fact of the matter is that the base model MacBook Air currently comes with about half as much RAM as you need for a really really decent LLM model. The integrated graphics are fast/efficient and the RAM is fast. The AMD Ryzen platform is similarly well-suited.

(Apple actually tells you how much storage their local model takes up in the settings > general > storage if you're curious)

We can imagine that by 2030 your base model Grandma computer on sale in stores will have at least 32GB of high-bandwidth RAM to handle local AI workflows.

chii 9/4/2025||
which is why i made the claim that hardware "problem" will be solved in the near future (i don't consider it solved right now, because even the apple hardware is too expensive and insufficient imho), but the more difficult problem of model availability is much, much harder to solve.
8n4vidtmkvmk 9/2/2025||||
There does seem to be a mind share mote, but all you have to do is piss off users a little bit when there's a good competitor. See Digg to Reddit exodus.
hiatus 9/2/2025|||
Which advertisers would risk having their product advertised by models that have encouraged kids to commit suicide?
Mars008 9/2/2025|||
This is not mutually exclusive. They have home made robots and let others sell robots on their website. The same way they want to use AI and have resources to make their own. One way to use is to drive those robots. Another to enhance their web site. Current version sucks. I recently return the item because their bot told it has functionality while in fact it didn't.
rswail 9/2/2025||
AWS is very much not the same as Amazon the product selling website.

The two are effectively separate businesses with a completely separate customer base.

GuB-42 9/1/2025||
> Of course, the AI talent war may end up being an expensive and misguided strategy, stoked by hype and investor over-exuberance.

To me, that's a pretty good explanation.

The world is crazy with AI right now, but when we see how DeepSeek became a major player at a fraction of the cost, and, according to Google researchers, without making theoretical breakthroughs. It looks foolish to be in this race, especially now that we are seeing diminishing returns. Waiting until things settle, learning from others attempts and designing your system not for top performance but for efficiency and profit seems like a sane strategy.

And it is not like Amazon is out of the AI game, they have what really matters: GPUs. This is a gold rush, and as the saying goes, they are more interested in selling pickaxes that finding gold.

bee_rider 9/2/2025||
I guess Amazon can also probably afford to wait until somebody comes up with an application for AI that is, like, something Amazon can actually sell or use…

Customer service bots? Maybe. Coding bots? I bet they use some internally. Their customers don’t really need them, or if the customer does, the customer can run it on their side.

janalsncm 9/2/2025|||
As I’ve said before, the kind of AI that makes money is called machine learning. Pricing ads, recommending products, improving search, optimizing routing.

In general these fall into the category of things humans cannot do at the scale and speed necessary to run SaaS companies.

Many of the things LLMs attempt to do are things people already do, slowly and relatively accurately. But until hallucinations are rare, slow expensive humans will typically need to be around. The AI booster’s strategy of ignoring/minimizing hallucinations or equivocating with human fallibility doesn’t work for businesses where reliability is important.

Note that ML algorithms are highly imperfect as well. Uber’s prices aren’t optimal. Google search surfaces tons of spam. But they are better than the baseline of no service exists.

wiether 9/2/2025|||
You mean something like Kiro?

https://kiro.dev/

janalsncm 9/2/2025|||
AI is huge, it’s just not the only thing happening in tech right now. I say this as an MLE but it seems really unbalanced that LLMs have gotten trillions in investment when other groundbreaking innovations like battery improvements or fusion power or gene therapy have gotten substantially less attention.

Disagree re: DeepSeek theoretical breakthroughs, MLA and GRPO are pretty good and paved the way for others e.g. Kimi K2 uses MLA for a 1T MoE.

bigbuppo 9/2/2025||
Big money investors know that real tangible products that have real tangible benefits aren't usually decimal-point-shifting-your-net-worth jackpots. They make money, sure, but factories can't be built in a day. Also, if they can make AI work as it says on the box, they'll be able to get rid of all those pesky employees and turn their companies into pure money-printing enterprises.

Pay no attention to the cracks that are showing. Nevermind the chill. Everything is fine.

energy123 9/2/2025||
I don't agree, based on my experience trying all Deepseek models on real world software tasks.
HuwFulcher 9/1/2025||
AWS specifically have really dropped the ball on this.

I interact regularly with AWS to support our needs in MLOps and to some extent GenAI. 3 of the experts we talked to have all left for competitors in the last year.

re:Invent London this year presented nothing new of note on the GenAI front. The year before was full of promise on Bedrock.

Outside of AWS, I still can’t fathom how they haven’t integrated an AI assistant into Alexa yet either

jackwilsdon 9/1/2025||
There's Alexa+ [0] which uses generative AI but it's planned to be a paid option at $20/mo.

[0]: https://www.aboutamazon.com/news/devices/new-alexa-generativ...

spanishgum 9/1/2025|||
> Alexa+ costs $19.99 per month, but all Amazon Prime members will get it for free.

I'm curious if non prime members make up a big market for Alexa. I rarely use my smart devices for anything beyond lights, music, and occasional Q&A, and certainly can't see myself paying 20$/month for it.

vitus 9/1/2025|||
I'm curious why anyone would pay $19.99/month for Alexa+ rather than just buy a Prime membership (which is $14.99/month).

Unless of course this is going to be met with a price hike for Prime...

wccrawford 9/1/2025||
That's what happened with Prime TV, and I absolutely expect it for the AI, too. And it might finally mean I cancel my Prime membership.
x2tyfi 9/1/2025||
Amazon Prime’s price hikes have a predictable cadence: * 2014: $79 to $99

* 2018: $99 to $119

* 2022: $119 to $139

We should expect a price hike from $139 to $159 in 2026, assuming the trend continues.

echelon 9/1/2025||
Meanwhile, Google Fiber has been the same price for 15 years. At least according to the billboard outside my window.
ls612 9/1/2025||
It works out for them because bandwidth gets cheaper over time but inflation eats away at that. $70 today is like $50 back in 2010 when GFiber first launched.
wenc 9/1/2025|||
If ChatGPT's Advanced Voice Mode could be served through an always-on device like Alexa, I'd pay for it.

Hmmm... maybe I can install do this through a cheap tablet....

janalsncm 9/2/2025|||
Model sizes have come down enough that it will be possible to run smart home control and simple Q&A entirely locally.
shaklee3 9/2/2025|||
you can do this with home assistant already
serial_dev 9/1/2025||||
Alexa consistently fails with the simplest of questions.

Only thing it can do is set a timer, turn off a light and play music.

It is still nice, but it’s so frustrating when a question pops into my mind, and I accidentally ask Alexa just to get reminded yet again how useless it is for everything but the most basic tasks.

And no, I won’t pay 240 dollars a year so that I can get a proper response to my random questions that I realistically have only about once a week.

WaltPurvis 9/1/2025|||
> Only thing it can do is set a timer, turn off a light

And it can't even do that without an Internet connection. As someone who experiences annoyingly frequent outages, it never ceases to boggle my mind that I have a $200 computer, with an 8" monitor and everything, that can't even understand "set a timer for 10 minutes" on its own.

prmoustache 9/2/2025||
Why did you guys bought this in the first place?
serial_dev 9/2/2025|||
Fair question, playing music and timers are nice, but their AI is abysmal so you can imo not ask it to do anything else. I previously worked for a smart home company so I wanted to test out product’s integration with Alexa, so we have some at home. I’m planning to get rid of them, though, and only leave one in the kitchen.
WaltPurvis 9/3/2025|||
Originally bought it for an elderly parent in assisted living; wasn't as useful as we'd hoped; repurposed it as a kitchen-timer/music-box, for which it works adequately as long as the Internet connection is up. I would not recommend anyone else buy one.
seviu 9/1/2025||||
and despite all this, I would pay 240$ a year so that Siri can reliably do what Alexa does today

oh the irony

ghaff 9/1/2025||||
Alexa has pretty much zero value for me.

Being able to just order something with zero shipping has a ton of value. I could drive down the street but it would still be an hour at the end of the day.

Video streaming has some value but there are a lot of options.

bboygravity 9/1/2025|||
Just pay 0 USD and use Grok app for free?

By far the best thing currently available.

echelon 9/1/2025|||
I'm predicting that Grok fails simply due to half (?) the software engineering populating not wanting to use anything Musk has developed.

Grok has to be more than n-times (2x?) as good as anything else on the market to attain any sort of lead. Falling short of that, people will simply choose alternatives out of brand preference.

This might be the first case of a company having difficulty selling its product, even if it's a superior product, due to its leader being disliked. I'm not aware of any other instances of this.

Maybe if Musk switches to selling B2B and to the US government...

If you piss off half of your possible user base, adoption becomes incredibly difficult. This is why tech and business leaders should stay out of politics.

lelanthran 9/2/2025|||
> I'm predicting that Grok fails simply due to half (?) the software engineering populating not wanting to use anything Musk has developed.

I think that's a wildly optimistic figure on your part.

Lets assume that developers are split almost 50/50 on politics.

Of that 50% that follows the politics you approve off, lets err on the side of your argument and assume that 50% of those actually care enough to change their purchases because of it.

Of the 25% we have left, lets once again err on the side of your argument and assume 50% care enough about the politics to disregard any technology superiority in favour of sticking to their political leanings.

Of the 12.5% left, how many do you think are going to say "well, let me get beaten by my competitors because I am taking a stand!", especially when the "beaten" means a comparative drop in income?

After all, after nazi-salute, mecha-hitler, etc blew up, by just how much did the demand for Teslas fall?

The fraction of the population that cares enough about these (on both sides) things are, thankfully, single-digit percentages. Maybe even less.

manishsharan 9/2/2025||
>>After all, after nazi-salute, mecha-hitler, etc blew up, by just how much did the demand for Teslas fall?

I had been saving up for a Tesla but now I am looking elsewhere. I think a lot of people are doing the same here in Canada. You can grok the actual numbers if you want.

rs186 9/2/2025||||
Yeah, a simple example is to just look at how many companies/universities have ChatGPT vs Grok subscriptions internally. I can imagine that many people would have a problem with subscribing to Grok, even if its performance is comparable.
redditor98654 9/2/2025||||
Hmmm, thinking aloud, Oracle?
qcnguy 9/2/2025||||
> This is why tech and business leaders should stay out of politics.

Yeah but they don't stay out of politics, do they? Gemini painting black Nazis was a deliberate choice to troll the vast majority of the population who isn't woke extremists.

My family subscribes to Grok and it's because of politics, not in spite of it. The answer gap isn't large today but I support Musk's goal of building a truth seeking AI, and he is right about a lot of things in politics too. Grok might well fail financially, the current AI market is too competitive and the world probably doesn't need so many LLM companies. But it's good someone wants AI to say what's true and not merely what's popular in its training set.

transcriptase 9/1/2025|||
[flagged]
bee_rider 9/1/2025|||
I think their point was that becoming very involved in politics in a way that alienates half of the population has tarnished Musk’s brand (although, I’d personally adjust that down to more like 1/3). If the point of your whataboutism is that previously it alienated the other 1/3… that doesn’t seem to improve their odds, right?

If anything they’ve now pissed off 2/3 of the population at some point or another.

felixgallo 9/1/2025|||
None of that actually happened.
inquirerGeneral 9/2/2025||
Literally documented proof.
felixgallo 9/2/2025||
Literally not. Elon Musk even published the infamous algorithm which he had claimed silenced and censored right wing voices. The only thing the algorithm did that was odd was that it had a special case written into it to boost Elon Musk. You can go look it up.
kentm 9/1/2025|||
Mechahitler, the South African genocide debacle, explicitly checking Elons Twitter feed, “You get your news from infowars” system prompts, etc have basically made Grok not a real option for me. I do not want to use a product that is specifically being engineered to be a right wing disinformation machine.

And no, generic brand safety mishaps are not the same; everyone is not doing this.

israrkhan 9/1/2025||||
I enabled Alexa+ few days ago on my devices. Everyone in our home immediately disliked the new Alexa. There were some fairly basic things that Alexa+ cannot do, and Alexa was able to do. Some fairly simple question/answering tasks, and questoins about status of an order.
iLoveOncall 9/1/2025||||
It will be free if you have a Prime subscription (which means nobody will ever pay for it given Prime is cheaper and you get much more included).

But the project is pretty much dead, it was supposed to launch in February or March and is still not anywhere close to being out.

HuwFulcher 9/1/2025||||
Yes have seen about that. It’s crazy to me that they still haven’t released it. Really think it could save a dying product
Jordan-117 9/1/2025|||
They basically have with Alexa+. It's slightly more limited than ChatGPT, but it sounds much more realistic than stock Alexa and blows it out of the water in terms of smarts. The old model was basically a Siri-like "set timers and check the weather with specific commands," plus some hit-or-miss skills you had to install separately. But the new one gives much more of a sense of understanding your question and can carry on conversations with contextual responses. I've been pretty impressed with it, and the nature of the Echo device makes it much easier to query at will than having to open the ChatGPT app and switch to voice mode.
chihuahua 9/2/2025||
I agree. I think the Echo devices are good for certain kinds of voice-driven LLM experience. Although it's not that useful for detailed responses and serious questions, since you can't go back and read its response again.
el_benhameen 9/1/2025|||
Having briefly interacted with AWS Q out of curiosity, I can see why they haven’t pushed much out publicly. Aside from giving someone a chuckle when they decided to call its suggestions “Q Tips”, it’s functionally useless.
kotaKat 9/1/2025|||
They all but abandoned Astro, their home robot. My suspicion (and information I've heard internally) all but points at them only using Astro as a testbed for self-navigating warehouse robotics, and now that they got what they wanted out of it, the Vesta team basically got thrown to the wolves.
newsclues 9/1/2025||
They bought a company that did warehouse robots called kiva before that

https://en.m.wikipedia.org/wiki/Amazon_Robotics

bee_rider 9/1/2025|||
Lingo question: is MLOps like devops for ML, or like flops for ML? I wonder because… actually, either case seems like somewhere Amazon might be losing experts to hot startups.
snoman 9/2/2025|||
The former. Basically: build, train, test, deploy, monitor, repeat for ML algos.
HuwFulcher 9/2/2025|||
As the other response said. It’s DevOps for ML. They have Amazon SageMaker which is the managed ML/MLOps offering that we use extensively because we’re a small team. The documentation is awful
coredog64 9/2/2025||
All AWS documentation is awful.
mv4 9/1/2025|||
Isn't "Alexa+" doing this? (I have not signed up)
liquidpele 9/1/2025|||
Didnt they basically can most of Alexa a few years ago? I think they realized asking a device questions doesn’t generate profit.
redditor98654 9/2/2025||
They thought Alexa will enable users to buy more from Amazon just by voice. But most users turned out like me. I would not spend a single dollar on Amazon without actually seeing the item on my mobile or desktop. I wouldn’t even add to cart via Alexa. That’s not an ideal user for device and service that requires hundreds of millions to run.
ghaff 9/2/2025||
You saw this with Amazon Dash buttons too. This idea that users would just go "Hey order me some more Tide" and Amazon would just do the right thing at the right price like some sort of intelligent personal assistant. Which it by no means is.
BoredPositron 9/1/2025||
They still have no serverless inference.
OliverGuy 9/2/2025|||
SageMaker have serverless inference endpoints
BoredPositron 9/2/2025||
Only if the pipe is defined if you bring your own pipe sage maker offers nothing.
zmmmmm 9/2/2025|||
Bedrock is not usable for that?
BoredPositron 9/2/2025||
Not for custom models.
jacquesm 9/2/2025||
That's because there is no lock-in in the current ecosystems for AI. Yet. But once AIs become your lifetime companion that know everything there is to know about you and the lock-in is maximized (imagine leaving your AI provider will be something like a divorce with you losing half your memory) these parties will flock to it.

The blessing right now is the limit to contextual memory. Once those limits fall away and all of your previous conversations are made part of the context I suspect the game will change considerably, as will the players.

IceHegel 9/2/2025||
There's a chance this memory problem is not going to be that easy to solve. It's true context lengths have gotten much longer, but all context is not created equal.

There's like a significant loss of model sharpness as context goes over 100K. Sometimes earlier, sometimes later. Even using context windows to their maximum extent today, the models are not always especially nuanced over the long ctx. I compact after 100K tokens.

Ozzie_osman 9/2/2025|||
But you don't have to hold the entire memory in context. You just need to perfect techniques to pull in parts of the context that you need. This can be done via RAG, multi-agent architectures, etc. It's not perfect but it will get better over time.
elorant 9/2/2025||||
From my experience context window by itself tells half the story. You load a big document that’s 200k tokens and ask it a question, it will answer just fine. You start a conversation that soon enough balloons past 100k then it starts losing coherence pretty quickly. So I guess batch size plays a more significant role.
IceHegel 9/4/2025||
By batch size, do you mean the number of tokens in the context window that were generated by the model vs. external tokens?

Because my understandings is that, however you get to 100K, the 100,001st token is generated the same way as far as the model is concerned.

luckydata 9/2/2025||||
I'm over simplifying here but graph database and knowledge graphs exist. An LLM doesn't need to preserve everything in context, just what it needs for that conversation.
IceHegel 9/4/2025||
Unless there is a trick that I am missing, I don't think this will work by itself. The fundamental thing is what can the model attend to as it generates the next token.

If you give a summary+graph to the model, it can still only attend to the summary for token 1. If it's going to call a tool for a deeper memory, it still only gets the summary when it makes the decision on what to call.

You get the same problem when asking the model to make changes in even medium-sized code bases. It starts from scratch each time, takes forever to read a bunch of files, and sometimes it reads the right stuff, other times it doesn't.

spiderfarmer 9/2/2025|||
Context will need to go in layers. Like when you tell someone what you do for a living, your first version will be very broad. But when they ask the right questions, you can dive into details pretty quick.
visarga 9/2/2025|||
Export your old chats and put them in a RAG system accessible on the new LLM provider. I did it. I made my chat history into a MCP tool I can use with Claude Desktop or Cursor.

Ever since I started taking care of my LLM logs and memory, I had no issue switching model providers.

luckydata 9/2/2025||
Do you have some kind of tooling to automate the process? Would like to try it.
sebastianz 9/2/2025|||
> But once AIs become your lifetime companion that know everything there is to know about you and the lock-in is maximized

Why? It's just a bunch of text. They are forced by law to allow you to export your data - so you just take your life's "novel" and copy paste it into their competition's robot.

Steve16384 9/2/2025||
It's never quite that straightforward, or perceived as that straightforward. That's why most people just renew their insurance as it's easier than messing about changing and worrying if it will be any better. And how easy is it to transfer emails to another provider?
zwnow 9/2/2025|||
Who even wants all your previous conversations taken into account for everything you do? How do you grow from never forgetting anything, making mistakes, etc? This is highly dystopian and I sure hope this will forever just be a fantasy.
visarga 9/2/2025||
I have made 100MB of my own chat logs into a RAG memory and was surprised I didn't like using it much. Why? it floods the LLM with so much prior thinking that it loses the creative spark. I now realize the sweet spot is in the middle - don't recall everything, strategic disclosure to get the max out of AI. LLM memory should be like a sexy dress - not too long, not too short. You get the most creative outputs when you hide part of your prior thinking and let the mode infer it back.
zwnow 9/2/2025||
I am not an AI enthusiast but I get what you're saying. I occasionally use ChatGPT due to Google being enshittified pretty much. I often do not like the things it tells me and I for sure do not like it complimenting everything I do, but thats something other people seem to like... In my experience starting a fresh chat after a while of back and forth can really help, so I agree with you. Having little to zero prior context is actually the point of view one needs sometimes.
paool 9/2/2025|||
in order for that lifetime companion, we'll need to make a leap in agentic memory.

how do you know memory won't be modular and avoid lock-in?

I can easily see a decentralized solution where the user owns the memory, and AIs need permission to access your data, which can be revoked.

randomNumber7 9/2/2025|||
I can easily see a world where users own the devices they buy and install the software they want, but the trend goes in the other direction.
ivape 9/2/2025|||
in order for that lifetime companion, we'll need to make a leap in agentic memory.

Well, let’s take your life. Your life is about 3 billion seconds (100 year life). That’s just 3 billion next-tokens. The thing you do on second N is just, as a whole, a next token. If next-token prediction can be scaled up such that we redefine a token from a part of language to an entire discrete event or action, then it won’t be hard for the model to just know what you will think and do … next. Memory in that case is just the next possible recall of a specific memory, or next possible action, and so on. It doesn’t actually need all the memory information, it just needs to know that that you will seek a specific memory next.

Why would it need your entire database of memories if it already knows you will be looking for one exact memory next? The only thing that could explode the computational cost of this is if dynamic inputs fuck with your next token prediction. For example, you must now absolutely think about a Pink Elephant. But even that is constrained in our material world (still bounded physically, as the world can’t transfer that much information through your senses physically).

A human life up to this exact moment is just a series of tokens, believe it or not. We know it for a fact because we’re bounded by time. The thing you just thought was an entire world snapshot that’s no longer here, just like an LLM output. We have not yet trained a model on human lives yet, just knowledge.

We’re not done with the bitter lesson.

diffeomorphism 9/2/2025|||
Basic questions: what does a GDPR request get you? Wouldn't providers like you to switch to them?

Just look at the smartphone market.

lelanthran 9/2/2025||
> Once those limits fall away and all of your previous conversations are made part of the context I suspect the game will change considerably, as will the players.

I dunno if this is possible; sounds like an informally specified ad-hoc statement of the halting problem.

HuwFulcher 9/1/2025||
Web archive: https://web.archive.org/web/20250901163205/https://www.busin...
iLoveOncall 9/1/2025||
Yeah Amazon is massively struggling to hire due to the extremely bad reputation of Andy Jasshole and the RTO 5 policy, and this is not exclusive to AI talents, but is the case for every single role. We have had reqs open for a year in my team and nobody wants to join.

Truthfully, I don't think anyone would recommend their acquaintances to join Amazon right now.

That said, Amazon is actually winning the AI war. They're selling shovels (Bedrock) in the gold rush.

__turbobrew__ 9/1/2025||
I have had multiple recruiter reachouts from AWS who obviously read my resume and are interested in short cutting me into a senior role at AWS doing interesting things, but at this point AWS reputation is so bad I don’t even entertain such offers.

For senior in-demand talent you are not desperate, and really only desperate people go to work for AWS as they don’t have any better options at a company which respects their employees.

alkonaut 9/1/2025|||
It's not like it had a good reputation earlier either (as a company, perhaps less problematic as an employer). But if I was offered multiple FAANG positions because I had some really attractive skill set, then I'd want a _lot_ more to work at Meta or Amazon than Netflix or Google, just based on my view of the corporate evilness. It's probably completely unfounded, but the fact I have that feeling just shows they haven't taken care of their brand.
Aeolun 9/2/2025||
I think I’d work at Amazon purely to save the world from the abomination that is cloudformation.
coliveira 9/1/2025|||
Amazon having trouble to hire I think it is a well deserved result. I hope they never hire great talent again. Lately I heard they're looking for contract hires, which seems to fit their cheapness and lack of ability to attract talent.
untrust 9/2/2025||
At some point, the turnover has to lead to "the blind leading the blind" with nobody having a clearer big picture view on the software they own. This can't be a productive way to run a company, but they seem to persist nonetheless. It may take many years, but I imagine their software will rot from within due to their hiring practices.
coliveira 9/2/2025||
Exactly, Amazon practices the equivalent of a decimation of their workforce. This may even work in the initial years, but over time they'll quickly lose their best minds and the software will be unmaintainable.
chihuahua 9/2/2025|||
It's almost funny how they just don't give a shit about being an attractive employer. They never have. Going back to 2002, it's always been "if you don't like it, there's the door."

It seems that they just don't care about the high turnover.

mikert89 9/1/2025|||
Bedrock is terrible and usage is not high, they cant even serve the anthropic models at scale.
cyberax 9/1/2025|||
Bedrock? It's like a vibe-coded "router" app. It really doesn't provide anything that is not provided by countless other companies.

AWS is falling behind even in their most traditional area: renting compute capacity.

For example, I can't easily run models that need GPUs without launching classic EC2 instances. Fargate or Lambda _still_ don't support GPUs. Sagemaker Serverless exists but has some weird limits (like 10GB limit on Docker images).

iLoveOncall 9/1/2025|||
Bedrock is not at all a router. They do provide a routing capability now, but at its core it's a wrapper around models so you can interact with any model with the same unique API.

> For example, I can't easily run models that need GPUs without launching classic EC2 instances.

Yeah okay, but you can run most entreprise-level models via Bedrock.

Aeolun 9/2/2025||
Only if you want them to go to random inference regions. God forbid you would want inference in a single region. Then you need to be satisfied with 12 month old models that have been superseded 2 times already.
internetter 9/1/2025||||
AWS doesn't need to do anything innovative and the enterprises still come. Every product AWS sells has a similar offering from a competitor. But businesses stick with amazon because its all in one. They get bills from one company, trust their security with one company, ect. The only thing that matters to AWS is its reputation.
cyberax 9/2/2025||
This works up to a point. I'm extremely familiar with AWS, but we simply _can't_ use it to train our models because it costs 2-3 times more than their competitors. All while requiring us to basically bring up all the infrastructure around maintaining the training cluster ourselves.
nickysielicki 9/2/2025|||
Frankly, this is strictly a positive signal to me.

Fargate and lambda are fundamentally very different from EC2/nitro under the hood, with a very different risk profile in terms of security. The reason you can't run GPU workloads on top of fargate and lambda is because exposing physical 3rd-party hardware to untrusted customer code dramatically increases the startup and shutdown costs (ie: validating that the hardware is still functional, healthy, and hasn't been tampered with in any way). That means scrubbing takes a long time and you can't handle capacity surges as easily as you can with paravirtualized traditional compute workloads.

There are a lot of business-minded non-technical people running AWS, some of which are sure to be loudly complaining about this horrible loss of revenue... which simply lets you know that when push comes to shove, the right voices are still winning inside AWS (eg: the voices that put security above everything else, where it belongs).

cyberax 9/2/2025||
> Frankly, this is strictly a positive signal to me.

How?

> The reason you can't run GPU workloads on top of fargate and lambda is because exposing physical 3rd-party hardware to untrusted customer code dramatically increases the startup and shutdown costs

This is BS. Both NVidia and AMD offer virtualization extensions. And even without that, they can simply power-cycle the GPUs after switching tenants.

Moreover, Fargate is used for long-running tasks, and it definitely can run on a regular Nitro stack. They absolutely can provide GPUs for them, but it likely requires a lot of internal work across teams to make it happen. So it doesn't happen.

I worked at AWS, in a team responsible for EC2 instance launching. So I know how it all works internally :)

nickysielicki 9/2/2025||
You'd have to build totally separate datacenters with totally different hardware than what they have today. You're not thinking about the complexity introduced by the use of pcie switches. For starters, you don't have enough bandwidth to saturate all gpus concurrently, they're sharing pcie root complex bandwidth, which is a non-starter if you want to define any kind of reasonable SLA. You can't really enforce limits, either. Even if you're able to tolerate that and sell customers on it, the security side is worse. All customer GPU transactions would be traversing a shared switch fabric, which means noisy bursty neighbors, timing side-channels, etc., etc., etc.
cyberax 9/2/2025||
> You'd have to build totally separate datacenters with totally different hardware than what they have today.

No? You can reset GPUs with regular PCI-e commands.

> You can't really enforce limits, either. Even if you're able to tolerate that and sell customers on it, the security side is worse

Welp. AWS is already a totally insecure trash, it seems: https://aws.amazon.com/ec2/instance-types/g6e/ Good to know.

Not having GPUs on Fargate/Lambda is, at this point, just a sign of corporate impotence. They can't marshal internal teams to work together, so all they can do is a wrapper/router for AI models that a student can vibe-code in a month.

We're doing AI models for aerial imagery analysis, so we need to train and host very custom code. Right now, we have to use third-parties for that because AWS is way more expensive than the competition (e.g. https://lambda.ai/pricing ), _and_ it's harder to use. And yes, we spoke with the sales reps about private pricing offers.

nickysielicki 9/2/2025||
none of this applies to g6e because it doesn’t have/need a pcie switch, because it doesn’t have rdma support (nor nvlink), which means sriov just works.
cyberax 9/2/2025||
And what is your point? What is stopping AWS from using g6e or g6dn on Fargate to keep up with the competitors?
nickysielicki 9/2/2025||
Nothing, but IMO it’s a bad idea. 1. customers who build a compute workload on top of fargate have no future, newer hardware probably won’t ever support it. 2. It’s already ancient hardware from 3 years ago. 3. AWS now has to take responsibility for building an AMI with the latest driver, because the driver must always be newer than whatever toolkit is used inside the container. 4. AWS needs to monitor those instances and write wrappers for things like dgcm.
cyberax 9/2/2025||
Fargate is simply a userspace application to manage containers with some ties-in to the AWS control plane for orchestration. It allows users to simply request compute capability from EKS/ECS without caring about autoscaling groups, launch templates, and all the other overhead.

"AWS Lambda for model running" would be another nice service.

The things that competitors already provide.

And this is not a weird nonsense requirement. It's something that a lot of serious AI companies now need. And the AWS is totally dropping the ball.

> AWS now has to take responsibility for building an AMI with the latest driver, because the driver must always be newer than whatever toolkit is used inside the container.

They already do that for Bedrock, Sagemaker, and other AI apps.

philipallstar 9/1/2025||
> the RTO 5 policy

I'm no expert, but I'm pretty sure this[0] is what RTO 5 is.

[0] https://www.phoenixcontact.com/en-pc/products/bolt-connectio...

spanishgum 9/1/2025|||
RTO 5 is "return to office, 5 days a week"
mensetmanusman 9/1/2025||
RTO 996 where it at
almostgotcaught 9/1/2025|||
[flagged]
riknos314 9/1/2025||
Amazon seems to be taking the "When Everybody Is Digging for Gold, It’s Good To Be in the Pick and Shovel Business" approach here.

Don't need to train the models to make money hosting them.

frollogaston 9/1/2025|
There's also dogfooding though
shagie 9/2/2025|
I am reminded of the Uncomfortable Amazon Truths ( https://news.ycombinator.com/item?id=20980025 ) by Corey Quinn.

While they're protected now, https://news.ycombinator.com/item?id=20980557 quotes the one I recall...

      - Nobody has figured out how to make money from AI/ML other than by selling you a pile of compute and storage for your AI/ML misadventures.
https://threadreaderapp.com/thread/1173367909369802752.html maintains the entire chain of tweets.
janalsncm 9/2/2025||
> Nobody has figured out how to make money from AI/ML

This is clearly not true. Google Ads? Every recommender system? Waymo self-driving? Uber routing algorithms?

If you swapped out ML for LLMs I would largely agree.

QuinnyPig 9/2/2025||
I do want to point out that I wrote this in 2019. It was a vastly different landscape, and AWS did a terrible job of promoting the ML value proposition.
shagie 9/2/2025||
I would love to see a retrospective on the AWS uncomfortable truths as a blog post and how they've held out over the past couple years.

2019 was a different time - though I suspect that your statement about making money (as in profit) rather than just revenue (reselling compute for less than you bought it) would hold true for most companies.

QuinnyPig 9/2/2025||
That’s an awesome idea. Let me get after it.
shagie 9/2/2025||
One of the debates I have with devops types is if OCI (Oracle Cloud Infrastructure) is a worse name than ICP (IBM Cloud Private).

And would this be admitting defeat to the powers of Terrible Orange Website to get you to write more?

As a side, in 2019 about a week after your tweets I was at a training session for Rancher which worked a reference to one of them into a joke.

QuinnyPig 9/4/2025||
I’m always amazed by how far my humor seemed to spread in those days.
chillee 9/2/2025||
Clearly not true anymore given OpenAI and Anthropic's revenue growth.
shagie 9/2/2025||
Revenue... yes. Profit is still an open question.

https://www.cnbc.com/2025/08/08/chatgpt-gpt-5-openai-altman-...

> Last year, OpenAI expected about $5 billion in losses on $3.7 billion in revenue. OpenAI’s annual recurring revenue is now on track to pass $20 billion this year, but the company is still losing money.

> “As long as we’re on this very distinct curve of the model getting better and better, I think the rational thing to do is to just be willing to run the loss for quite a while,” Altman told CNBC’s “Squawk Box” in an interview Friday following the release of GPT-5.

Selling compute for less than it cost you will have as much revenue as you want to pay for.

soared 9/2/2025|||
Anthropic founder described it as: if each model were a company, they be hugely profitable. It looks bad since when the model you trained in 2024 is generating net positive revenue, you’re also training a more expensive model for 2025 that won’t generate revenue until then. So currently, they’re always burning more cash than they’re bringing in, under the expectation that every model will increase revenue even more. Who knows how long that lasts, but it’s working so far.

Paraphrase is from the podcast he was in with the stripe founder, cheeky pints I think

janalsncm 9/2/2025||
Which is not a good comparison because the LLMs are products not companies. If they are companies, they are competing against each other for revenue.

If I switch from Gemini Pro to Opus, that is good for Anthropic. If I switch from Opus 4 to 4.1, that’s not as good for Anthropic.

Sad that these CEOs can get away with this level of sophistry.

jquery 9/2/2025||||
>Revenue... yes. Profit is still an open question.

could have said the same thing about most FAANG companies at one point or another.

janalsncm 9/2/2025|||
The problem for OpenAI and the difference with other FAANGs is that they don’t own the internet. Other companies are able to replicate their product, which prevents them from fully realizing profits.

Google doesn’t have this problem. They only run Google ads in their search results. Same thing for Facebook.

jcranmer 9/2/2025|||
If I have the numbers right, OpenAI will burn more money this year alone than all of those prior companies did in their entire profitless phase of existence.
chillee 9/2/2025|||
Their gross profits are very high even though they're not making operating profit.
More comments...