Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing

Posted by _____k 2 hours ago

Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing(www.businessinsider.com)

119 points | 149 commentspage 2

simonw 2 hours ago|

I'd be interested to know if this is about individual employee AI usage, or use of AI tokens in production features, or both - and assuming both, what the split is.

I can see how Uber could burn unbelievable amounts of tokens if they start running internal features that run a bunch of prompts against every completed ride, or every customer profile, for example.

Or maybe this is about employee usage, but they introduced some stupid "you get evaluated on how many tokens you used" thing a couple of months ago when that was trendy and are just beginning to notice how much that cost?

devin 1 hour ago|

IMO, it's undoubtedly both.

The number of product teams who have shipped expensive-to-operate AI features is wayyyy up there, and for many of the scenarios I've seen, customers simply don't care or are unwilling to pay significant premium for access to it.

At the same time I'm starting to see some direction from people in leadership that I should "use the right model for the job" and things along those lines, which is a very, very different line from what I was hearing 12 months ago.

My continued prediction is that we are going to see a tweak on the SaaS model where the sweet spot moves to metered usage pricing of really fine-grained API-based access for apps which traditionally have been operated solely via the UI. Long term the trend is going to be "we'll house the data, enrich it, maintain it, provide fine-grained API access over it tailored to model usage, and you bring the model" with some services opting to give you the model interaction layer/harness. IOW I don't think SaaS is dead. Far from it. However, I do think that a lot of people are going to be looking to interact with SaaS apps via their own models with APIs that support those use cases better than a lot of those APIs do today.

victor9000 1 hour ago||

Clearly they need more layoffs, and for that matter why keep anyone around? After all, AI will be writing 100% of code in 2026.

nickvec 9 minutes ago||

Surprisingly, Uber hasn’t had a mass layoff since 2020. The company currently has ~34,000 FTEs, which I personally think is insanely bloated for what amounts to a taxi + food delivery app.

killingtime74 30 minutes ago||

By 2025 we will have AGI and software developers don't be necessary. Also next year we will have self driving.

rr808 1 hour ago||

I have Opus 4.7 at work at 15x. Burns through tokens like water. It feels like one of these new mega datacenters is just for me. I'd love to know what the bill is, but we're just encouraged to do as much AI as possible.

bachmeier 1 hour ago||

> Burns through tokens like water.

Pretty sure I know what you're saying, but the visual on this one doesn't match the point you're making.

rr808 56 minutes ago||

lol yeah I'm not a poet.

3eb7988a1663 50 minutes ago||

Just append a reactive metal. "Like water through sodium"

loeg 1 hour ago|||

2^30 tokens costs something like 2^10 dollars, order of magnitude, if that helps ballpark.

tmaly 1 hour ago||

[dead]

cryo32 2 hours ago||

Waiting for tokenedging next.

postsantum 1 hour ago||

^ Philip K. Dick's unreleased book title

SecretDreams 2 hours ago||

Is this when you type the prompt into the text window, but don't hit enter? Make the GPU see the message "x is typing"? Lol.

FartyMcFarter 2 hours ago||

As long as there's an RPC connection established and a partially sent request, I think it would count.

hmokiguess 1 hour ago||

Why do keep doing this? It's the same as measuring by LoC, we know it's not gonna work. Also, see Goodhart's Law[1]

- https://en.wikipedia.org/wiki/Goodhart%27s_law

mustaphah 1 hour ago||

Feels like they are debating internally whether to cut people or AI spending. Very healthy debate. Let's hope they spare people.

InsideOutSanta 1 hour ago||

"He said that, based on talks with Uber's senior engineering leaders, he realized higher token usage did not translate into a proportional increase in useful consumer features."

He's saying that like it's some grand epiphany and not the most self-evident, obvious thing I've heard this month. Some of the literal dumbest people on earth are in charge of these major companies.

amluto 17 minutes ago||

This is also Uber we’re talking about. The company that famously developed a massively engineered ledger to track every event across the entire company, globally consistently, forever, in a single database. This definitely adds enormous value to the bottom line!

Barrin92 45 minutes ago||

>obvious thing I've heard this month

not only this month, but it is the basic statement of the single most well known 50 year old book in software project management lol. At this point we need to wipe the slate clean and start over, the industry is run by illiterates.

JackDanMeier 1 hour ago||

At what point is there a difference between a burn rate and tokenmaxxing? Isn't it the same as during the dotcom bubble?

chihuahua 2 hours ago||

It's amazing that it took months to figure this out. "Well we thought that if engineers are told to maximize costs through AI use, to consume as much as possible of a resource that costs us money, then obviously good things will happen. Imagine my surprise when it didn't turn out that way."

Imagine if engineers were ranked based on their AWS spend. People allocate VMs and fill databases with terabytes of random bits, to get to the top of the AWS leaderboard. If you don't do this, you're ranked at the bottom, and good luck at the next review cycle. Who could have expected that this is not the road to success?

this_user 2 hours ago||

The point of this was always to explore what is possible with AI as quickly as possible. Obviously, there is going to be a lot of waste, but the 5-10% of employees who are truly thinking about it and discovering novel applications are what you are truly after. Because right now, you effectively have a giant, as of yet poorly explored space of potential uses.

Anyone who can find the actually valuable portions of the space early has a potentially huge competitive advantage. Even if the result of the experiment is the negative that AI is actually mostly not that useful, that is still extremely useful information in a time of great uncertainty regarding outcomes.

The bottom line is that this approach may be expensive, but if you have the money to burn, it's far from the worst strategy if you are trying to position yourself correctly for the future.

adrianN 2 hours ago|||

What’s the huge advantage though? Adopting workflows that give big productivity gains is relatively easy even for big corporations. It’s only an advantage if you can keep it secret.

OTOH maybe we’re in for a future of patenting prompts.

uejfiweun 1 hour ago||||

The thing I don't get though, is that most people just don't have that much work they need to do. I can use AI to pretty easily get my work done just via the regular chat interfaces. But because of the tokenmaxxing metrics that leadership tracks, I end up just having the AI deliberate for hours on random things just so that I can boost my token numbers. I think tokenmaxxing for the end goal you described is only realistic when the engineers are truly buried under a backlog of work.

Mtinie 1 hour ago|||

[dead]

davnicwil 1 hour ago|||

I think unfortunately it's not about what seems obvious, or even what seems more likely, but about what seems retrospectively justifiable regardless of outcome.

The incentive structure of this type of decision is 'absolutely under no circumstances existentially mess up'. Ostensibly with respect to the organisation, but in actual reality much more so with respect to the individual(s) involved in the decision.

If everyone else is doing something that kind of obviously makes no sense, and you decide to break from the crowd by instead doing what does make sense, then there's a pretty solid chance of gaining a temporary edge while reality resolves the truth. But those gains probably won't matter all that much for the organisation, or indeed your position within it. It's a solid chance of an unimportant gain.

However on the other hand, there's a tail risk that something very unexpected happens and the thing everyone's doing that makes no sense actually turns out to make sense - sometimes even for entirely unpredictable incidental reasons - and then, well, you're in trouble. Not necessarily 'you' the organisation.. they'll likely be able to catch up and it won't matter that much. But for 'you' personally, the decision maker, it's very much not good.

As a bonus, in the much more likely scenario that the thing that makes no sense turns out to indeed make no sense, you're in the same boat as everyone else, there's no relative loss, and most importantly you don't stick out as someone who did something as risky as to go against the prevailing, albeit pretty clearly nonsensical, sentiment.

So basically, game theory tells you pretty quickly to just go with the thing that makes no sense if you're optimising for some (weighted) cross of what's best for the organisation and yourself as the decision maker.

saghm 2 hours ago|||

Someday maybe Goodhart's Law will be intuitive to people making decisions like this, but not any time soon I guess

dgellow 1 hour ago|||

> It's amazing that it took months to figure this out

We aren’t there yet, so far it is just a COO questioning the investment

roxolotl 2 hours ago|||

The inability of leaders to understand Goodhart’s Law is always a sight to behold. They see a number go up and pat themselves on the back for how well their employees are making it go up without ever wondering if the thing they care about is happening.

solenoid0937 2 hours ago||

You say "amazing that it took months to figure this out" as if the answer to the question is obvious.

But it's not. Some FAANGs are doing amazing things with unlimited tokens. Other companies have no clue what to do with tokens, they've just told their engineers to max them.

It really depends on how you're using the tokens. If you're just using them for Codex and Claude Code - yeah, tokenmaxxing is incredibly dumb.

saghm 1 hour ago|||

In other words, people who are productive get more done when you scale up what they're already doing, and people who aren't productive will not magically become productive when you scale up what they're already doing. That's incredibly obvious, because we've seen how this plays out repeatedly in so many different ways (lines of code, commits, tickets closed, etc.), and it has nothing to do with tokens or even programming, but just how trying to manage people works.

steveBK123 2 hours ago||||

> Some FAANGs are doing amazing things with unlimited tokens. Others have no clue what to do with tokens.

Unlimited tokens is different from “use AI a lot or we will fire you, and we are counting token consumption as usage”. Obviously the latter is stupid and yet it was done in many places.

SpicyLemonZest 1 hour ago|||

I'm not convinced it actually was done in many places, although I understand why in a bad job market people don't trust that it isn't happening in secret. Every time I've heard of a token leaderboard or such it's come with a denial that the company is using it as an employee performance metric.

steveBK123 24 minutes ago||

> it's come with a denial that the company is using it as an employee performance metric.

Surely the anonymous employee feedback polls are totally anonymous too. BigCorp loves you, is family, and would never harm you.

morpheuskafka 2 hours ago||||

> But it's not. Some FAANGs are doing amazing things with unlimited tokens

Giving someone unlimited access to a resources is not the same as directing or incentivizing them to use it for the sake of using it which is what the parent comment criticized.

As for the other FAANGs, Meta and Google have (not good but still) frontier models of their own, so they are very different from a company paying API costs per token.

dgellow 1 hour ago||||

Where can I see those amazing things done by FAANGs?

fsloth 2 hours ago||||

> Some FAANGs are doing amazing things with unlimited tokens.

Would love to know what things!

arkadiytehgraet 42 minutes ago||

OP (solenoid0937) is an unfounded AI-hype peddler and an Anthropic shill (check their comment history), do not expect them to provide an actual example of their wild claims.

SecretDreams 2 hours ago|||

Show me some fang that have made nice outwards facing products through a fully embraced AI workflow?

AI is an accelerator that engineers should know and have access to, but it's not something that should have mandated usage and quotas around. It's also absolutely dangerous for young engineers and the like - it fundamentally denies you of the "learning" aspect. I'm now seeing in interviews young graduates being given AI tasks to complete and they come back with a correct solution and no concept of how it is working.

You learn and reinforce learning by DOING and reading in depth. High level summaries don't teach anything and are the kinds of things only VPs care about. So, unless the intention in the future is for everyone to be a VP using AI to do the work, we need some middle ground here and some real thought around implementation of these tools or there's going to be a generational canyon gap of knowledge between being able to "say" and being able to "do".

rcvassallo83 1 hour ago|

Oof leader of bubble are starting to take a step back?

More comments...