Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing

Posted by _____k 5 hours ago

Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing(www.businessinsider.com)

179 points | 237 commentspage 5

Rohunyyy 5 hours ago|

Now we are going to get a new profession. Token Engineer! They will be experts on tokenmaxxing! The job growth that the billionaire CEOs promised us from AI is finally here!

fsloth 5 hours ago|

Well there are already offerings like githits (https://news.ycombinator.com/item?id=46105112) that sort of promise optimize bang-per-buck of inference

yapyap 5 hours ago||

wtv

aplomb1026 4 hours ago||

[flagged]

nekzn 5 hours ago||

It’s funny that “maxxing” entered the common vocabulary.

chihuahua 5 hours ago||

If you're not tokenmaxxing, you're getting tokenmogged on the AI leaderboard, and your next review ain't gonna be pretty.

internet2000 5 hours ago|||

A good 80% by volume of the modern vernacular is 4chan language that got sanded down.

nekzn 5 hours ago||

Sanding down is how we got goyslop turned into slop.

harvey9 5 hours ago||

Slop is a word in its own right which got the goy prefix later in life.

amirhirsch 5 hours ago||

I like this too. I have been intentionally -maxxingmaxxing to get the meme out there. It's a good canary to sort out who gets the spicy takes from the pedestrians who probably still copy-paste into the ChatGPT web app like a psychopath.

cobblr_mosaic 3 hours ago||

[dead]

ath3nd 3 hours ago||

[dead]

pocksuppet 5 hours ago||

what the fuck is this timeline I am stuck living in

gigatexal 5 hours ago||

I find it useful that if they cut the use altogether I will pay for it out of pocket.

dghlsakjg 5 hours ago||

Would you decide its usefulness based on how high the bill is, or how many things you get done while using it?

The former is the issue, and how many companies have been operating. It's like a trucking company ranking driver effectiveness by fuel used instead of by cargo moved.

sottol 5 hours ago|||

Maybe that's the plan :)

But on a more serious note, do we know how much Uber spent per technical employee/month? I assume it is far more than even any of those $200 "max ai" plans.

And the other question is how much the public would be willing to spend, in my estimation this is as "cheap" as it will ever get (main-stream at least).

KronisLV 5 hours ago|||

> I assume it is far more than even any of those $200 "max ai" plans.

Am in a random small company, colleague spent 100 EUR a day on Sonnet through AWS Bedrock (needed to use a EU region). Paying for tokens will get you in a deep hole financially compared to any of the subscriptions, unless it's like DeepSeek or one of the other models that are priced a bit better, though that's also a tradeoff in what they can/cannot do and also where the data goes. Ended up trying out the Mistral subscription for the US stuff btw, it was fine.

Marciplan 5 hours ago|||

bigCo’s don’t get to do the $200 Max plans, they have unlimited plans but get charged like API

sottol 5 hours ago||

Exactly. But I did find an article ([1]) and spend doesn't seem that high per engineer ($150 to $250 per eng) - at least on average, I assume the costs were skyrocketing towards the end.

> Adoption climbed from 32 percent of engineers in February to 84 percent classified as agentic coding users by March. By spring, 95 percent of Uber engineers used artificial intelligence tools monthly, and roughly 70 percent of committed code originated from those tools. About 11 percent of live backend updates were written by agents with no human in the loop, according to Uber's own disclosures.

> The numbers behind the spend are what make the story instructive rather than anecdotal. Monthly cost per engineer ranged from $150 to $250 on average, with power users running between $500 and $2,000.

My guess is that the reason to rethink AI-spend was probably the exponential growth in cost over time, and tokenmaxxing payoff not being immediately obvious as mentioned in the article.

[1] https://www.forbes.com/sites/janakirammsv/2026/05/17/uber-bu...

mattlondon 5 hours ago|||

Probably long term each dev gets their own GPU and runs a model locally I expect. Seems like a more sustainable approach, even if a local model is not absolute SOTA.

ianm218 4 hours ago||

GPUs are much more efficient at parallelizing requests for LLMs so it's going to much more efficient to centrally host. Maybe big companies it would make sense to get their own though.

iwontberude 5 hours ago|||

Except you won’t because they will threaten to fire you and force you to route all of your AI through data protection proxy to stop exfiltration by filtering and tracking prompts/response tokens.

throwaway613746 5 hours ago||

[dead]

egypturnash 5 hours ago||

Uber COO says he just decided to short a bunch of AI company stock.

epolanski 5 hours ago|

Slightly ot, but I really dislike this reddit WSBization of HN.

Adds nothing insightful to these discussions.

cwillu 5 hours ago|||

“Please don't post comments saying that HN is turning into Reddit. It's a semi-noob illusion, as old as the hills.” --hn guidelines (there are links to examples in the original)

noman-land 5 hours ago|||

It's unfortunately the WSBification of the entire society.

hmokiguess 4 hours ago|

Why do keep doing this? It's the same as measuring by LoC, we know it's not gonna work. Also, see Goodhart's Law[1]

- https://en.wikipedia.org/wiki/Goodhart%27s_law