Top
Best
New

Posted by jamdesk 1 day ago

OpenAI unveils its first custom chip, built by Broadcom(techcrunch.com)
Announcement: https://openai.com/index/openai-broadcom-jalapeno-inference-...

https://decrypt.co/371971/openai-broadcom-jalapeno-first-cus...

https://www.cnn.com/2026/06/24/tech/openai-broadcom-jalapeno...

811 points | 461 commentspage 6
m3kw9 1 day ago|
They tested on spark model, i bet it's a mix of that with focus on inference speed. Whatever it is, hopefully it shows up with current models as faster. Token/s is as big thing as anything else, and thats where they can really gain some edge over the competition.
tehjoker 1 day ago||
No information on how significant the reduction in energy per token is. No information on amortized price per request. Increasingly its clear OpenAI must demonstrate order of magnitude reductions in cost to not die, this is investor story time without that information.
rvz 1 day ago||
No surprise here. [0]

[0] https://news.ycombinator.com/item?id=45429514

mdp2021 1 day ago|
Actually, I find the idea of using Cerebras etc. for /training/ (not just inference) surprising: I did not stumble in much data and discussion about "super-CPUs" in that area, where NVidia (with the tools focused on it) has that long-built edge...

Edit: contextually,

> Jalapeño is specifically designed for inference

Imustaskforhelp 1 day ago||
Although this seems to be for inference itself only and not training but inference is a recurring cost and training is a one time cost and so to me, even if Nvidia still gets moat on training, I don't think that it could ever justify its massive evaluations because for example, some chinese models are actually trained on Non-Nvidia models. The moat in that is incredibly thin.

(at the moment), I think that if I were Nvidia, I would be a bit terrified and I imagine the stock to not be doing super great as I can just imagine everyone online might start talking about it for better or for worse.

I am a bit impressed by OpenAI but is this what can be classified as a plan for OAI to salvage itself and all the commitments it has made nearing a 1.4 Trillion dollars from my memory and this article[0] is from 2025

But could OpenAI simply walk out of its commitments when necessary (for example to Nvidia) if this chip works out or what exactly might happen in the future as these commitments are asked to be paid for, its still smart for OAI to diversify with this chip and to have more deeper ways of revenue than just being a simple middleman but I imagine that Nvidia and others have also invested in OpenAI and they must not be happy with this change.

The thing with AI deals are that they have become so complicated that it is hard for me to find the first order impact of things, let alone second or third order impacts and financial accountability seems to be impacted quite heavily because of all of it and there is some sense that it is done so intentionally.

https://techcrunch.com/2025/11/06/sam-altman-says-openai-has...

wilg 1 day ago||
> significantly better performance-per-watt than current state-of-the-art alternatives

An interesting example of how the current market dynamics incentivize low cost and therefore power efficiency and therefore lowering resource use.

mangomanai 1 day ago||
owow...what gonna be next.....thei own robot????
zuzululu 1 day ago||
im very excited that frontier models now have so much money and revenue they are releasing their own chips that could change the relationships and bottom line
gaigalas 1 day ago||
But nvidia's moat is software support, isn't it?
KeplerBoy 1 day ago|
You don't need a whole lot of software support if you just want to serve a single family of LLMs.
gaigalas 1 day ago||
A lot of companies that serve a single family of LLMs seem to prefer nvidia though. Why is that?

It's not just good drivers, which is what moats them for games and ML. It's a multi-decade work of making chips that are nice to program for and software infrastructure around them.

Apple and Google have excelent chips, yet they needed to invest a lot in long-tail software projects to make those chips do actual premium work. Still not state of the art for serving LLMs (although Google is strong in that, mostly because it piggybacked on previous chip-related software work for phones and so on).

SV_BubbleTime 1 day ago||
> A lot of companies that serve a single family of LLMs seem to prefer nvidia though. Why is that?

If you write your tools for CUDA, you’re going to prefer hardware the runs CUSA.

How is there anything more to it than this?

gaigalas 1 day ago||
Cool. That's it.

What will people use to write for Jalapeño matters.

Nvidia has multi-decade heritage. Apple spent almost a decade in MLX. Snapdragon failed partly here. OpenAI announced nothing regarding to that, so this big moat that multiple companies have (nvidia the most prominent) is nil for them.

hari_vardhan 1 day ago||
hi
xyst 1 day ago|
> built by Broadcom

AI is cooked bro. Broadcom is the death sentence of anything.

More comments...