Nvidia is proposing a beast of a CPU system for Windows PCs

Posted by tosh 4 hours ago

Nvidia is proposing a beast of a CPU system for Windows PCs(twitter.com)

88 points | 218 comments

stego-tech 36 minutes ago|

The Unified Memory pool is what will continue to be the “game changer” in systems architecture, especially outside of data centers.

The reality is even cutting edge games and consumer workloads don’t actually take full use of the PCIe bandwidth of the GPU or the bandwidth of its GDDR memory. Even local AI use cases don’t substantially or meaningfully benefit from faster memory, at least to average consumers.

A unified memory pool does two things:

1) Lets systems optimize utilization based on need, rather than be confined to specific pools

2) Reduce overall memory cost, by letting system builders purchase a single type of memory in bulk instead of having to figure out GDDR vs DDR memory placement (important for SFF/portable machines)

So at a time when memory is expensive, unified pools make more sense. Even when memory becomes cheap and plentiful again, it’s just practical at this point to allocate a larger overall pool instead of managing discrete sets.

The one big drawback is security. A shared memory pool means side-channel attacks against memory from the GPU or CPU could potentially compromise the other as well, meaning memory-safe designs are going to be critical to security going forward (which is good for Rust adherents, I figure).

Asmod4n 30 minutes ago||

yeah, you only see double digits in performance degradation from going from pcie 5 to 3 with a 5090 (at x16 speed), with everything else its like in the single digits area.

stego-tech 20 minutes ago||

And the thing we gamers forget is that we’re the outlier. We’re the edge case.

Most consumers will never really care about, let alone see, the difference in PCIe or memory bandwidth impacts from such a shift to unified memory pools. We might (being, at least in my case, a huge nerd), but I’m increasingly of the opinion that if modern blockbuster games are built for upscaling/reconstruction anyhow, then suddenly such sacrifices to performance seem acceptable relative to the gains in efficiency.

jmyeet 21 minutes ago|||

Unified memory is only a feature because NVidia so aggressively uses VRAM for market segmentation.

The 5090 ($2k MSRP but realistically $3-3.5k) is almost the same as the RTX 6000 Pro (~$10k). Same memory bandwidth (1800GB/s). Slightly different CUDA cores (21k vs 24k). Big difference? VRAM (32GB vs 96GB).

NVidia ultimately doesn't want to upset this segmentation so the RTX Spark will never undermine their other offerings. This is why I think Apple has a real market opportunity if they choose to embrace it.

zozbot234 14 minutes ago||

Even low-VRAM cards are actually very useful for running the comparatively smaller dense layers in large local MoE models. This only requires transfering very small amounts of data across the PCIe bus (similar to pipeline parallelism) so it fits nicely around the existing bottlenecks on that hardware.

vlovich123 29 minutes ago||

> (which is good for Rust adherents, I figure).

As a Rust adherent, please do not put words in our mouths or set up unrealistic expectations for other people by linking together concepts at a very shallow level.

Language level memory safety has no answer for hardware security flaws which is what side channel attacks are. No programming language can provide memory privacy if another chip in your machine can read your memory. Just like no programming language can protect your application from a kernel vulnerability of the kernel it’s running on.

stego-tech 23 minutes ago|||

Damn. That wasn’t my intention at all, I was just pointing out that Rust has another reason to see wider adoption vis a vis the usual Valley advertising bullshit of deliberately conflating hardware security with software security. I personally give no fucks what something is written in, only that it’s written well enough that I don’t have to twist arms or babysit yet another sloppy piece of code in my enterprise.

b112 22 minutes ago|||

But... it's rust.

modeless 1 hour ago||

The Qualcomm Snapdragon X2 Elite Extreme trounces Nvidia's chip in single core CPU performance. It beats Intel and AMD's best, too. It has unified memory. It's the only CPU in the same league as Apple's M-series in both CPU performance and power efficiency. And it's available in laptops today, not later this year. People are sleeping on Qualcomm.

ksec 30 minutes ago||

It trounces ARM's old CPU design. The X925 used in this Nvidia chip is 2 years old. X930 or C1 has shipped with Mediatek Dimensity 9500 which is what the Snapdragon 8 Elite Gen 5 / X2 Elite should be compared to. Although Qualcomm still has a lead in performance, but it is increasingly shrinking.

But perhaps more importantly. Nvidia seems to be doing a lot better with its ecosystem. Nvidia has much better distribution channels and partners building on top of their PC Gaming GPU. It also have gaming developers relations that is unmatched by any in the industry.

Qualcomm has so far failed to execute this, both in PC and on there Server CPU side.

Danox 1 hour ago|||

Microsoft is sleeping on Qualcomm with their lousy port of Windows to Arm processors…

criticalfault 1 hour ago|||

and is Qualcomm is sleeping on Linux?

embedding-shape 1 hour ago|||

Seems like not? Judging based on https://github.com/qualcomm-linux something is happening, although I can't say how much. They definitively seem awake at least.

Elixir6419 14 minutes ago|||

one of the biggest issue i see is the devicetree nonsense. It makes every single laptop and bios version very unique and requires a lot of housekeeping. There are also big chunks of work (as i understand it) to be done around hibernate and decent suspend support.

My experience (wanted to use x13s as daily sriver) is that there was good progress for about a year, until jhovold was leading the charge, but something expired and qualcom as far as i can tell forgot that some progress should happen on x1 and x8c as well as x2.

jeroenhd 1 hour ago||||

The problem with these chips on Linux is that something has been happening for months but you still end up needing to download special editions of ARM Linux images to get these devices to work properly.

Some distros still need extracting Qualcomm firmware from Windows to get Linux to work properly. Audio remains a challenge, like x86 Linux decades ago. Apparently camera stuff works these days but produces images of subpar quality.

These issues also occur on normal Linux. My experience with my Lenovo+Intel laptop was that it took three months after release for the firmware to work properly (and the Nvidia drivers took much longer, but that's my fault for buying something containing Nvidia hardware). Intel managed to do what Qualcomm did in months rather than years.

I hope Qualcomm finally sorts this shit out, I really do, but with the prices of computers these days, I'm going to need to see quite the discount before I'll consider buying anything with a Snapdragon.

justincormack 58 minutes ago|||

They run a hypervisor under the OS, and dont support actually running directly on the hardware, its very odd.

stefan_ 58 minutes ago|||

Yes, Ubuntu on the previous gen Snapdragon X is still trash.

reactordev 1 hour ago|||

10000000x this. They have been sleeping on Arm since windows phone. I just don’t see them ever having an original thought again.

They could have had a 128core arm chip by now.

adabyron 48 minutes ago|||

They have original thoughts! It's just that those employees get squashed by other divisions or having to meet short term quarterly profits it seems.

There's also the whole giant trillion dollar company doesn't want to invest and let small ideas grow. They only focus on things that move the needle, which isn't much at the size.

Had Microsoft executed and invested, they could have made a come back imo in both search, mobile & hardware. Unfortunately major lack of leadership or they just don't want those areas.

reactordev 21 minutes ago||

No doubt the individuals have thoughts. It’s just corporate never actually does ANY of them. It’s a MBA mill.

dylan604 40 minutes ago|||

Unless the chip was called Copilot, they are not thinking anything about it. If was called Copilot, they'd have already figured out how to shove it down your throat.

alt227 22 minutes ago|||

Why do people care so much about single core performance? We are all professionals here and I bet most of our workloads are multi core. I get that these new arm chips from Apple and Qualcomm are great at one thing at a time, but for professional workloads high end x64 chips still cannot be beaten on the desktop.

darkwater 1 hour ago|||

Is it well supported under Linux?

diabllicseagull 56 minutes ago|||

I've been keeping an eye on the state of Linux on the first gen of X Elite and it's sad that the potential is not fully materialized outside WoA. Take a look at what peeps are going through:

https://discourse.ubuntu.com/t/ubuntu-concept-snapdragon-x-e...

modeless 1 hour ago||||

Qualcomm has been upstreaming Linux support for some of their chips but they're not working fast enough and I don't think the latest chips are there yet unfortunately.

izacus 40 minutes ago|||

No, not at all, those machines are currently unusable on Linux.

bradfa 1 hour ago|||

Qualcomm is a “fool me once, shame on you, fool me twice, you don’t fool me twice” kind of situation. So many horrible experiences in the past that people are going to be hesitant.

Qualcomm are trying harder now it seems. But it will take time to repair their reputation in the PC market.

thewebguyd 1 hour ago|||

They burned me with the first gen Snapdragon X Elite. Before the various laptops with it were out they promised Linux support. Here we are, years alter, still no fully OOTB support. Ironically, the GPU firmware were just mainlined in the kernel 4 months ago, but they still haven't done the same for the 1st gen X elite.

Tuxedo computers tried and didn't succeed either.

I will never buy Qualcomm again. I avoid them on phones as well by just buying Apple. They do not support their hardware beyond the release.

jeroenhd 57 minutes ago||

> I avoid them on phones as well by just buying Apple

To each their own, but I don't recall Apple ever mainlining any of their drivers on Linux. You're rightfully angry on the laptop side of things, but Apple is much worse than Qualcomm when it comes to open source support for their phones.

Qualcomm probably shouldn't have promised Linux support in the first place. Everyone seems to love Apple's hardware even though you're practically stuck with macOS. Had Qualcomm just stuck to Windows-only, they would've probably received a much better reception by the tech press.

mlinhares 51 minutes ago|||

Apple doesn’t sell general purpose computers outside of their own hardware so this doesn’t make any sense.

dismalaf 54 minutes ago|||

At least Apple tells you they don't support anything except their own OS, Qualcomm just pretends to offer support.

derefr 1 hour ago||||

Can you say more? I don't have any memory of Qualcomm-related scandals(?), but I just read the news; I've never really been a user of their chips.

re-thc 24 minutes ago|||

> Qualcomm are trying harder now it seems.

Not really, the 1st. iteration got stuck in legal land and other delays.

bradfa 14 minutes ago||

They hired a good number of smart people who know how to do open source. So they’re trying. We shall see if it works.

dismalaf 1 hour ago|||

Too bad Qualcomm provides shit drivers for Linux, never updates any of their drivers (had a Samsung/Qualcomm phone with drivers years behind the equivalent Google Pixel phone), etc... They are the absolute worst actor in the entire computing world, don't care how fast their chip is.

SecretDreams 46 minutes ago|||

People aren't sleeping on Qualcomm, they're tired of Microsoft Windows as a janky ass OS.

re-thc 23 minutes ago|||

> People are sleeping on Qualcomm.

Technically speaking, Qualcomm acquired Nuvia, which is where this came from and that company came from ex-Apple engineers wanting to do what Apple said no for their chips.

So it's almost same CPU design (origins).

Innittech 47 minutes ago|||

[dead]

captainregex 1 hour ago||

[flagged]

mauricioc 37 minutes ago||

Do you have a source for this? If not, please stop breaking HN guidelines [0]: "Please don't post insinuations about astroturfing, shilling, brigading, foreign agents, and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data."

[0] https://news.ycombinator.com/newsguidelines.html

infecto 4 hours ago||

"I am not sure how many people will run AI models locally. It still seems like a niche application to me. However, it will make decent machines to play video games."

I don't know who will be the winner but with some of the recent releases from gemma it seems more probable that you may run some models locally if only from a cost perspective, not even considering business security. Not sure how this type of architecture would make for good gaming though, puts into question the whole statement.

"Ranked in the top 2% of scientists globally (Stanford/Elsevier 2025) and among GitHub's top 1000 developers" - side note but this guy puts this everywhere, gives me probably the inverse of what he is marketing for.

root-parent 3 hours ago||

"I am not sure how many people will run AI models locally. It still seems like a niche application to me. However, it will make decent machines to play video games..."

This is the 2026 edition of Ken Olsen: "There is no reason anyone would want a computer in their home"

throw0101a 2 hours ago|||

> This is the 2026 edition of Ken Olsen: "There is no reason anyone would want a computer in their home"

Digging into this:

> In conclusion, there is evidence that Ken Olsen did doubt the need for computers in the home, but the evidence is based primarily on the testimony of David Ahl who was perturbed when the personal computer project he championed at DEC was not supported by Olsen in 1974.

> Olsen’s resistance may have been similar to that expressed by another DEC executive, Gordon Bell. In 1980 Bell thought home terminals would act as gateways to remote computers which would provide appropriate services.

* https://quoteinvestigator.com/2017/09/14/home-computer/

It was supposedly said in 1977: most computers at that time were not small, and so it would not be surprising that people would not expect the general public to desire a large, power-hungry, noise-y apparatus in their house.

wccrawford 25 minutes ago|||

That's exactly the point. Until recently, AI models that could run on home machines were so bad that it was very hard to imagine anyone wanting to.

And, like the overly large machines of 1977, models are getting faster, leaner, and better. It's happening a lot quicker, though.

kristov 53 minutes ago||||

We kinda ended up with terminals connected to mainframes anyway. The terminal being the web browser, and the mainframe being SaS. So it wasn't that far off.

supermatt 6 minutes ago||

the network is the computer

wslh 1 hour ago||||

The simple explanation is that predicting the future is generally impossible. It doesn't matter if it's Olsen or anybody else.

parineum 2 hours ago|||

It doesn't really need this much explanation.

People take these quotes out of context all the time. Said in a business context, there was no need, at that time, for someone to have a personal computer.

There's no business justification in 1977 for a personal computer department at a business. It's similar to the gates quote about RAM (I think it was 64KB?).

These statements aren't meant to be forever quotes. Their business plan quotes.

michaelcampbell 1 hour ago|||

> It's similar to the gates quote about RAM (I think it was 64KB?)

640, and Bill Gates said he either never said that, or at least never remembered having said it. I think there is no evidence anywhere that he did.

https://www.computerworld.com/article/1563853/the-640k-quote...

glimshe 1 hour ago|||

Or maybe he simply made a mistake. Big deal. This doesn't speak negatively of his other achievements.

shermantanktop 1 hour ago||

He had a long career and presumably many successes, and is fallible like the rest of us. But a half-remembered zinger with no context makes for zippier posts I guess.

The early popularity of Minitel, the continued popularity of ssh/tmux, and the web browser itself indicates that bespoke client applications are not the only way. He wasn’t directionally wrong.

joering2 1 hour ago||||

or "640K ought to be enough for anybody."

DonHopkins 1 minute ago|||

[delayed]

shermantanktop 1 hour ago||||

https://quoteinvestigator.com/2011/09/08/640k-enough/

Nobody ever said that, at least not as an assertion or prediction. The actual instances of similar language are from multiple people describing their earlier thoughts before they learned it wasn’t true.

throw1234567891 1 hour ago|||

There’s no public proof this has ever been said, and if it was, if it was not taken out of context.

AaronAPU 3 hours ago||||

That’s too strong of an assertion.

Local models aren’t deterministically equivalent in capabilities to foundation models. Home computers are turing complete; just like a mainframe. They are just slower. Often not slower enough to matter.

sandworm101 2 hours ago||

Most people are ok with slower. An AI that lets you edit a family picture, in say 30 seconds, locally is preferable to one that is instantaneous but requires you to submit that picture to examination/storage/training/sale in someone else's AI ecosystem. If i want to crop my ex out of family photos, i should not have to first give that photo to Microsoft. If want an LLM to write a book report for me, i dont want it also alerting my school. And if i write a memo for a client, and i want an LLM to check the spelling, i dont want that memo leaked either.

parineum 2 hours ago|||

> Most people are ok with slower. An AI that lets you edit a family picture, in say 30 seconds, locally is preferable to one that is instantaneous but requires you to submit that picture to examination/storage/training/sale in someone else's AI ecosystem.

Maybe if you ask them that question, but if you show them two products, they'll definitely prefer the faster one. 30 seconds is a long time to watch a progress bar.

sandworm101 1 minute ago|||

Fast and public, or slow and private. Not everyone wants, or is allowed to, share thier data with the AI world.

spwa4 1 hour ago|||

Plus there's the other question. If this thing is slower ... what's the price? The desktop/mini-pc version of this is $3000, after all. At this performance level what is an acceptable price for the laptops?

People definitely aren't going to accept more expensive + slower ...

Pxtl 1 hour ago|||

I'd like to think so but the existence of Google and Apple and Microsoft's cloud based photo tools with phone integration suggests that's false.

You could run a pretty good home server on $50 of gear and yet we never saw any real adoption of OwnCloud/NextCloud style products as an alternative to Google Drive/Photos or Apple Cloud.

Why should LLM/Transformers be any different? Especially when you need a proper expensive GPU to run them instead of a Raspberry Pi?

com2kid 3 minutes ago|||

After the latest round of cloud storage price increases my non technical wife has been asking if we can do local backups instead...

thewebguyd 1 hour ago|||

Apple's photo tools run on device, and they'll probably ship more on device foundation models at WWDC too.

On-device AI is going to be important, I think. It doesn't have to take the form of a chatbot UI to be useful.

fg137 1 hour ago|||

You seriously think running LLM is the same thing as general computing?

ako 10 minutes ago||

It’s better, it’s useful even for those who don’t have a deep knowledge of computers. I’d expect more AI users than programmers, than ms-word users, than excel users.

jb1991 1 hour ago|||

He’s just a braggart. When you see something like this in somebody’s personal bio on social media, it’s basically a banner that means “take everything I say in the context of me promoting myself.”

smcleod 1 hour ago|||

Qwen 3.6 is far ahead of Gemma for most (but not all) things. I've deployed it out across a number of M5 MacBooks and it's genuinely useful for many tasks. It won't replace an Opus or current gen Sonnet sized model but it's still amazingly good for its size and probably as good as or just a bit before Sonnet 4 era. Far more reliable for tool calling, coding, agentic tasks and faster than the Gemma models especially with MTP.

zozbot234 45 minutes ago|||

Qwen 3.6 is a toy compared to DeepSeek V4 Flash or Pro. These models can now run on Apple Silicon hardware with as little as 32GB RAM for the Flash (with 2-bit quant, which is still quite capable) using SSD offloading, with just-about-reasonable performance for interactive use, and far better performance on longer contexts than Qwen (due to the more efficient KV cache/attention mechanisms in DeepSeek).

Very significant improvements may be viable for unattended inference via large-scale batches, which can reuse sparse experts and thereby mask some of the latency involved - this is quite unique to DeepSeek, again due to its efficient KV cache.

greenavocado 31 minutes ago||

Qwen 3.6 27B still curb stomps Deepseek V4 in coding

Pxtl 1 hour ago|||

I've got a Qwen 3.5 running on a 12GB 3060 and it's dumb as a stump but still smart enough to get some useful work done. Since it's my daily driver desktop I havent jumped to 3.6 since last time I did I quickly ran out of vram and locked the desktop environment.

But yeah, the Qwen line is pretty impressive on commodity hardware.

derefr 1 hour ago||

I must be using LLMs very differently than y'all, because I can't think of a single thing I would rely on an LLM that's "dumb as a stump" to do for me.

To me, LLMs are for asking research questions + exploring design spaces + pointing at codebases to investigate bugs. And those all benefit from the model being as "smart" (in terms of both fluid intelligence and burned-in knowledge) as possible.

I'm guessing there exist problems where "intelligence past a certain point" doesn't matter, so these medium-sized models can match the performance of the bigger models. But what problems might those be?

flatline 1 hour ago|||

The HN crowd is, by and large, not the target audience for his self promotion. I guarantee there is one and this is more or less effective.

falsemyrmidon 1 hour ago|||

> this guy puts this everywhere, gives me probably the inverse of what he is marketing for.

Do you think he's in mensa too?

unmole 3 hours ago|||

> you may run some models locally if only from a cost perspective

I have a hard time believing running a model on a laptop will be cheaper than running it in a datacenter. Why wouldn't economies of scale apply here as with every other computation?

zozbot234 39 minutes ago|||

The datacenter setting has huge economies of scale for low-latency, just-in-time inference using extremely large models, but that's not the only viable use of AI. Batched, unattended inference of possibly smaller and weaker models, while theoretically viable in a datacenter setting, is far from the best use of that hardware. This is where local AI is at its best.

itishappy 51 minutes ago||||

It's cheaper for the AI provider to use your laptop instead of their datacenter.

dgellow 3 hours ago||||

A laptop is really a pretty bad form factor to run LLMs. Worst cooling, more expensive memory that you cannot replace, resell value depreciating fast. It’s fine for tinkering, small scale research, and demos but it’s definitely niche.

The vision NVIDIA is selling is pure marketing IMHO

wazdra 3 hours ago||||

This is assuming that you'll be priced the fraction of computing that you consumed. But you are actually paying for their infrastructure, for the R&D (and also the computation that went into training the model) etc. It is not clear that, for your own small computations, this kind of costs are needed, but you will still pay your share in the investment the provider made so that they could serve everyone's computation needs.

hungryhobbit 1 hour ago|||

But, currently ... you're not. AI companies are operating at a loss, and are being subsidized by their investors.

Local may or may not be cheaper than remote now, depending on the details, but the factors you describe won't affect the math nearly as much as they will once that subsidization ends.

wjnc 2 hours ago|||

In that analogy bigtech AI is currently investing in cleaner air for all of us? We _could_ breath it through their hose, but might as well breath it outside.

jerf 1 hour ago||||

What "every other computation"? I seem to have a lot processing power at my disposal here, between my cell phones, laptops, gaming PCs, various other hardware devices.

You're going to need to analyze the problem much more deeply because it sound like the standards you are implicitly applying would result in "economically, everything should be centrally hosted" but that is clearly not the result that obtains. Even a modern mid-grade cell phone is no slouch; you may not be running a current-gen frontier AI on it but you certainly can do a lot of other rather intense things locally that would have been laughable 10 years ago, like suprisingly high powered games.

TylerE 1 hour ago|||

Because economy of scale isn't really the right metric here. A machine you were you were going to buy anyway essentially has a TCO of $0.

bespokedevelopr 1 hour ago|||

The security aspect is the main driver why I’m seeing so many businesses investing in local hardware. They know the models aren’t as good (caveat that they also can’t run Chinese models) and that’s ok. Places that really care about security and data governance already aren’t on the bleeding edge. They wait for the nice stable lts version, they lock down dev machines in frustrating ways and have lots of IT admin layers.

But they also want to taste the sweet fruit of AI so the only way to do this that a CISO will approve is on local air gapped hardware. It’s a niche but still a billion dollar niche.

thewebguyd 1 hour ago||

Microsoft is working on this with their new execution containers (https://github.com/microsoft/mxc)

GeekyBear 56 minutes ago|||

> However, it will make decent machines to play video games."

Where you will need games to be rewritten for ARM to get full performance, just like on Apple's M series chips.

jayd16 1 hour ago|||

Maybe they just mean from a "it can run a lot of DLSS" perspective.

cyanydeez 1 hour ago|||

128GB seems the sweet spot for local models. I can program and install most GitHub projects with opencode and QWEN 32b with mtp.

anyone whose addicted to token theoughput is losing the operational knowledge and offline capabilities.

if you arent moving to the AMD 395 or MACs then youre hitching aride on the expensive calory ride

throw1234567891 1 hour ago||

If you could buy a 256GB you’d be claiming that 256GB is a sweet spot. But I agree with you. Crack-tokens are not the future.

cyanydeez 49 minutes ago||

no, the fact that MACs and x86 and soon ARM are all going to have 128GB models in every sector, yeah, sure.

But watching everyone flounder because claude goes down or forcing you on API costs.

I'm programming things that'd take me days with a PC that, without OpenAI's VRAM shenagans, would cost you $2k.

It's more than just 'this is what I could do' it's definitely about 'this is what anyone could do with a new PC purchase'.

throw1234567891 39 minutes ago||

You must be unaware that System76 was already selling 192GB machines, mac studios used to be 512GB max. The only reason why we don’t have them anymore is that we are in RAM shortage.

cyanydeez 7 minutes ago||

I'm aware you can have more. the term "SWEET SPOT" references a area that anyone/everyone can get to and isn't some magical expensive unicorn.

You're doing what the IT industry has been addicted to for decades: number goes up.

voidfunc 2 hours ago|||

> "Ranked in the top 2% of scientists globally (Stanford/Elsevier 2025) and among GitHub's top 1000 developers"

This made me laugh. I can only image how insufferable this person is to deal with.

unstatusthequo 1 hour ago|||

I hope a family-level AI appliance is a thing later. Local non-cloud assistant that lives in the house, families interact via voice or phones or whatever. Knows the contextual family stuff you need, etc.

Pxtl 1 hour ago||

We didn't get people buying family-level file servers for the family photo gallery and documents at any real scale, so i doubt we'll see similar for AI especially when the cost is that much higher for GPUs vs an SBC machine.

sandworm101 3 hours ago|||

Lots of people are already running AI locally. They are the people buying up all the consumer-grade nvidea gpus. What are they doing with them? Well, the same things people with home media or email servers are doing: stuff they dont want to share with the general public.

Zetaphor 3 hours ago||

I want to reduce my dependency on companies like Google, OpenAI, and Anthropic. Aside from the concerns of data sharing I'm also not a fan of how they run their operations, for example Anthropic now using xAI's Colossus data center which is poisoning a marginalized community, or OpenAI getting in bed with the military.

Not everything I want to use an LLM for requires "PhD level intelligence", and increasingly I'm finding more uses that involve sharing my personal data.

Yesterday my local model helped me when looking for a doctor who is in-network for my insurance. I threw it a screenshot from the providers search results and it looked up reviews for all of them.

pratnala 2 hours ago|||

Which model are you running?

Zetaphor 1 hour ago||

Qwen 3.6 35B-A3B and 27B both at Q8 on a Strix Halo machine

sandworm101 2 hours ago|||

My local AI is currently upscaling an old british comedy from sub-dvd quality to 1k. (It is not availible other than on DVD.) It looks like it will take about a week for my pair of 5060s to chew through the task.

eszed 1 hour ago||

Which show?

sandworm101 37 minutes ago||

Chelmsford 123

I own the DVDs so I'm OK upscaling/editing my own copies for my own use. But if I ran the task on an ai service I would no doubt trigger copyright issues.

SwtCyber 3 hours ago|||

I think the local-model use case is going to become less niche pretty quickly if the models keep getting smaller and more capable. Even if most people do not care about privacy or offline use, the cost argument is pretty strong

iLoveOncall 4 hours ago||

> "Ranked in the top 2% of scientists globally (Stanford/Elsevier 2025) and among GitHub's top 1000 developers" - side note but this guy puts this everywhere, gives me probably the inverse of what he is marketing for.

Lol yeah seriously, that stinks "I ask AI to generate a huge amount of bullshit and upload it to pad irrelevant stats".

Absolute loser.

nkurz 3 hours ago|||

I agree that it sends the wrong symbol, but actually Daniel is great. He cares tremendously about doing work that is actually real-world useful. I've co-written a few papers with him, and he's really hard working and open to outside suggestions. The danger is that if you send him comments, he'll eventually manage to rope you into writing a new and improved version. Seriously, if you are a non-academic computer scientist with a good idea that you want to publish, he'd be incredibly open to working with you.

As to why he now has this on his blog? I also cringe when I read it. I presume someone told him he should self-promote more, and this is his lame attempt to do so. He's almost certainly the most cited person in his department, but it's entirely possible that none of his colleagues actually know this. Cut him some slack. Self-promotion is not his strength. He's a nerd's nerd, and not a marketer. I'll mention to him that his attempt here might be backfiring when I'm next in contact with him.

hgoel 1 hour ago|||

I kind of get it in the sense that every academic has to make themselves somewhat comfortable with self-promotion even if they don't like it. It's an important part of getting funding, but putting a blurb like that everywhere just hurts his credibility I think.

infecto 1 hour ago||||

I cringe calling it out but it just stood out as it was plastered everywhere and I actually have never seen his links before.

iLoveOncall 2 hours ago|||

> As to why he now has this on his blog?

He doesn't just have it on his blog, he has it EVERYWHERE. Sometimes 2 or 3 times on the same page.

dgacmu 54 minutes ago||||

He's not a loser; he's done some really fun work that many people use daily. I've used his range mapping trick in multiple projects/papers. It's elegant.

It sounds like he's gotten bad advise about how to market himself /or/ this is being marketed to people who have bigger checks to write and whom he believes will be responsive to this kind of marketing. As an academic, it rubs me very wrong - I think it's detrimental to the field when we get into h-index stacking contests or citation count comparisons. But I don't know what incentives he's responding to, which seems important for putting this stuff in context.

(as an aside, it turns out that polars + fastexcel is about 10x faster than pandas + openpyxl for searching that dataset, if anyone else is curious what he was actually talking about. :)

netsharc 3 hours ago||||

I found his website, https://www.lemire.me/en/ , and the "2%" brag is the very first sentence, geez.

Being the top x% is what OnlyFans girls brag about, professor...

And it's not exactly brain surgery, is it? https://www.youtube.com/watch?v=THNPmhBl-8I

Zetaphor 3 hours ago||

> Daniel Lemire’s blog is one of the top 50 most popular blogs on Hacker News, the standard tech news aggregation site.

Citation needed

nkurz 3 hours ago||

https://refactoringenglish.com/tools/hn-popularity/

thg 1 hour ago||

For posterity: It's rank 34 at the time of this comment

SkiFire13 1 hour ago||||

That lines looks very cringe indeed, but the guy has some crazy good blogposts on SIMD stuff.

dagmx 3 hours ago||

This feels fluff to me on the part of the author (whose work I don’t want to trivialize) but I don’t think they’ve actually looked deeper than a paper spec sheet on this.

1. Yes it has the same number of cores as a 5070 mobile. It’s also running at a shared peak of 2/3 the bandwidth and a shared peak of 2/3 the TDP. The GPU by itself will likely perform at half the dedicated units performance

2. Apple may not have SVE2 but they do have the AMX (private) and SME. I don’t see why he thinks the SVE2 will give him more performance than the SME.

3. He mentions a single core type but doesn’t mention the total makeup. We already have known for a year how the DGX Spark compares to Apple chips. For CPU it’s roughly equivalent to an M3 Pro and for GPU compute (not rasterization) it’s between an M4 Pro and M4 Max without considering bandwidth.

The real advantage to these is that they run CUDA. That’s it. Otherwise when they launch they’ll be 2-3 generations behind where Apple is and 1 gen behind AMD.

The other super power of the DGX Spark was the NIC for pairing them together. But that’s been removed here too.

storus 51 minutes ago||

> GPU compute (not rasterization) it’s between an M4 Pro and M4 Max without considering bandwidth

You are likely thinking about token generation which is dependent on memory bandwidth where Apple has an edge. Spark's GPU compute is way higher than even M5 Max (17 FP32 TFlops), around 2x FP32 TFlops... It's literally 6144 CUDA cores like desktop 5070, slowed down by slow memory and lower TDP (29.7 vs 31 FP32 TFlops on 5070).

wmf 26 minutes ago|||

Lemire is very narrowly interested in CPU SIMD so within that niche it may be interesting. As you said, overall the Spark is good but not great.

llm_nerd 1 hour ago||

It is absolutely fluff, and the only reason this worthless tweet is on the front page of HN is that this audience has a habit of canonizing certain people, and treating each of their bowel movements as prophetic.

Guy suddenly became aware of a chip that the rest of the industry long knew about, seems completely unaware of the competitors, and posts about how it's a BEAST and will be a GAME CHANGER.

Like the DGX Spark was a game changer? Eh, it has mostly been a massive disappointment. An overpriced nvidia laptop isn't going to change the equation an iota.

proxysna 3 minutes ago||

It's just a DGX spark with faster memory and a windows boot?

ftchd 6 minutes ago||

https://xcancel.com/lemire/status/2062880075117113739

epolanski 1 minute ago||

Not gonna lie, I'm buying one of the 128GB ram ones for local inference if price is human.

mariopt 31 minutes ago||

I think most people are not understanding what this kind of laptop will provide.

Before we get local AI, we'll be using hybrid AI.

Running big models locally is unrealistic ($$$$$) but, if you imagine an Agentic Workflow where some bits run on the cloud and other smaller tasks locally, it's an amazing deal. You don't need Opus/Code/DeepSeek/Kimi/etc to do basic stuff that models like Gemma4:12b/Qwen-27b can do locally with much less latency.

Having a laptop where I can use a remote big model and combine it with 5 local domain specific models, is something I would love to do today. Imagine using OpenCode and you've a small model deciding which tasks run locally, then decides if you've a good local model for XYZ task or if we use a cloud model.

My main concern is: Is this hardware powerfull enough to allow local quick models switch? Unlikely but I hope I'm wrong

GuestFAUniverse 2 hours ago||

And who in 2026 is still anal-fixated on a "Windows" PC?

It's just a personal computer. It normally runs multiple operating systems just fine.

Windows PC sounds like people talking about tech who are either payed by M$, or embed pictures into Word documents to send them.

Nobody has to kill the fun those OS agnostic machine allow, by artificially bind them to a shitty OS.

zdragnar 1 hour ago||

Enterprise, of course. They probably buy more PCs than the rest of the market combined.

Even for personal use, I'd imagine the amount of people dual booting Windows and something else are a very tiny minority.

Saying "Windows PC" is a pretty reasonable way to distinguish between "made by Apple" and "made by someone else" because the market of PCs that aren't made by Apple and don't come with Windows is really, really tiny.

To be honest, this seems like a strange hill to take such an aggressive stance upon.

jeroenhd 48 minutes ago|||

Hopefully anyone who wants to run anything other than Windows on an Nvidia-produced device has learned their lessons at this point. Although, a cursed Nvidia Hackintosh would be extremely funny.

For normal people, there are three computer operating systems: Windows, Apple, and ChromeOS. Nvidia isn't going with ChromeOS and Apple hates their guts, so Windows is the only normal operating system they can market.

Their marketing makes clear that these devices aren't the piddly Chromebooks that ruined the desktop experience for so many people (expensive Chromebooks were nice, but rare in practice).

Qualcomm promised Linux support, failed to deliver, and now anybody burnt by their promise won't want to buy their hardware again. If they promise a Windows PC, people won't have reason to complain when Linux or FreeBSD or SerenityOS won't boot on there. Given Qualcomm's failures here, Nvidia is probably doing the right thing.

kmac_ 12 minutes ago|||

Windows is dying a death by a thousand small, user-unfriendly decisions. This is genuinely sad because the technology underlying Windows is actually very robust and flexible.

So, the partnership is maybe natural, but not prospective. Also, note how Linux is getting popular among gamers. Of course, it's way behind Windows, but the direction of the change is clear.

I'm convinced that Nvidia is not primarily targeting the consumer market and that the ultimate goal for its CPUs is the server space. The company invests effort where the money is, and consumer products account for only a fraction of its total revenue. Maintaining a presence in the consumer market seems more like a way to avoid a complete pivot than a strategic priority.

dylan604 33 minutes ago|||

> Although, a cursed Nvidia Hackintosh would be extremely funny.

I did this for years. We ran Resolve color correction suites with external chassis to place multiple Nvidia GPUs in it at a fraction of the cost of the shitty TrashCanMac that was available. Lots of people continued to use the 2012 Cheese Grater MacPro with its older CPUs. The only way to get modern (at the time) compute in a Mac was to use a Hackintosh. Since it wasn't for personal use, not having things like AppStore, Messages, Music, etc wasn't a big deal, so building a Hackintosh was easier.

I built one for personal prosumer use around the time of the 1080s that allowed me more machine for the dollar than Apple offered. Once the M-series chips came out and they were capable of what the Hackintosh was doing for me put me off of building anything newer.

jayd16 1 hour ago|||

A big push specifically for Windows ARM from Nvidia seems like relevant information.

bigyabai 1 hour ago||

> It's just a personal computer

Your x86 machines were, but these are ARM SOCs. Many of them don't even support UEFI, let alone the upstream Linux kernel.

rvba 1 hour ago||

Getting rid of UEFI is bad?

bigyabai 1 hour ago||

Can you quote where I said that?

dylan604 32 minutes ago||

Sir! I'm not an LLM.

dofm 3 hours ago|

Here is the press release for the actual machine:

https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-...

I have been somewhat surprised at the lack of commentators observing that this is Microsoft and above all NVIDIA launching a device that is fundamentally at odds with the metered cloud model of AI.

When you look at the other announcements and murmurings (better offline BYOK for Copilot, talk of an unmetered AI future) I think it’s clear that these two firms understand that cloud-only AI is not sustainable or inherently in their interests. But their willingness to undermine OpenAI with a product like this is notable.

thewebguyd 58 minutes ago||

Yeah, "unmetered intelligence" was probably the most used phrase at MS BUILD this past week. They are going hard into local AI

dofm 34 minutes ago||

I don't think you can interpret this as anything other than a sanctioned rebuke, right? Everyone has a strong visceral sense for what that means.

Copilot just got proper "offline" BYOK support, didn't it? Presumably that was one of the things they were talking about. Though I imagine that has something to do with the fact that Zed has supported that properly for months.

tantalor 3 hours ago||

Maybe. Or they are simply hedging their bets.

More comments...