Open source AI must win

Posted by vednig 9 hours ago

Open source AI must win(opensourceaimustwin.com)

914 points | 285 commentspage 2

lying4fun 1 hour ago|

I’m sorry I can’t read past the first paragraph

FrojoS 1 hour ago||

What about pirated models?

eunos 2 hours ago||

My grim view is that it's just one incident away from some evil freaks to use ablated offline model for some nasty acts to have lawmakers lose their mind and try to regulate open source models and even consumer GPU. Think the latest 3d printers restriction.

zozbot234 2 hours ago|

> some evil freaks to use ablated offline model for some nasty acts

If this is a serious concern, why hasn't some red teaming effort demonstrated this possibility already? The fact of the matter is that ablation can't give a model world knowledge it doesn't have as part of training, it can only make the model confabulate. The "nasty" areas of concern are most notable for their world-knowledge requirements, which is where local models are at their weakest anyway.

eunos 2 hours ago||

> why hasn't some red teaming effort demonstrated this possibility already?

I'm sure they have but as usual we are a reactive society than proactive. Only when incident has occurred then we have momentum to act.

FabCH 3 hours ago||

While it is not at all practical to train an LLM with tens or hundreds of billions of parameters on hobbyists hardware, what if there are other architectures that perform just as well but are easier to train by 1000 volunteers?

I always wondered if 1000 1M parameter models fine-tuned to specific tasks with a small router could perform as well as 100B models.

And I know this is roughly how MoE works, but current MoE models still require training the model as a whole, and big players don’t have an incentive to change that.

But OpenSource community does…

logicchains 2 hours ago|

It is practical, albeit not as efficient: https://arxiv.org/abs/2603.08163 . But organizing enough people with decent-enough GPUs is the challenge.

borzi 2 hours ago||

I'm already coding more with DeepSeek than Opus, I'm doing my part :)

em-bee 8 hours ago||

what is Open Source AI even?

to me Open Source, like Free Software, is something i can run on my own computer. any AI system that runs on a computer that i do not control is by my definition not Open Source.

so how then can Open Source AI win? it can't even compete. even if we collect enough money and create a dedicated Open Source organization to build and run a community owned AI datacenter, how does that help?

so what exactly is the demand here?

nl 8 hours ago||

When kubernetes was released there were very few people who could run it, and even less that could run it usefully.

Right now there a few people who can run a 1T model at home, even less who can run a 5T model and probably single digits who can run a 10T model.

But if an open source 10T model was available you can be sure people would find new ways to quantize it, new ways to configure hardware and and new ways to think about problems that would make it useful.

1T+ models (Deepseek v4, Kimi K2.6 etc) are available as open weights now, and for ~$5000-$10000 you can run them usefully at home. 2 years ago no on was contemplating that.

$250K to run a 10T model might be possible now. There are many companies that will pay that, and that will push the tools and techniques downwards for the rest of us.

verdverm 6 hours ago||

case in point: https://spark-arena.com/leaderboard

cortesoft 6 hours ago|||

> any AI system that runs on a computer that i do not control is by my definition not Open Source.

This is not true at all. It would be open source if you could download it and run it anywhere that is capable, and are free to move it and modify it as much as you want.

Just because you don't have a computer at home powerful enough doesn't mean it isn't open source.

rustcleaner 5 hours ago||

I think he means theoretically in possibility space, without relying on a based insider leaking a 'closed' frontier model to bittorrent or hyphanet.

sheeshkebab 8 hours ago|||

Qwen models are actually very competitive with frontier models, and you can run them on your local computer. Gotta have a decent graphics card and by that time the current cost of the rig may not justify it over paying $100/month for cloud model but it’s all out there.

nirui 6 hours ago|||

Qwen is still controlled by Alibaba, one company. We can't let the future be in the hands of a few companies, can we?

Fun fact: Qwen was not initially a Apache Licensed project, it was based on a custom license from Alibaba that restricts commercial use: https://github.com/QwenLM/Qwen/blob/ba2d85a13b28ed1ee0dde2d6.... There's no guarantee that they won't just switch it back later.

Kudos for them for switching to Apache License, of course. BUT, they're still a for-profit company. So as DeepSeek btw.

rustcleaner 5 hours ago||||

>Gotta have a decent graphics card and by that time the current cost of the rig may not justify it over paying $100/month for cloud model but it’s all out there.

Never, ever, subscribe. When you subscribe, they win. They cornered the silicon market to force you to subscribe. Don't be a sub, or at least keep your sub tendencies in the bedroom. ;^)

NamlchakKhandro 7 hours ago|||

Fluctuating token costs make it worth it

itkovian_ 8 hours ago|||

Projects like pluralis agora solve this problem. Really what you want is the model to be collectively owned and governed, not local

singpolyma3 8 hours ago|||

LLMs that you can run locally on hardware that is not out of range to acquire is already a thing for some time.

bitwize 7 hours ago||

Recently I fired up Gemma4-26B-A4B on my 8-year-old PC... and it ran surprisingly well!

But I am going to need a much beefier machine to get it to the point where it can do any but very trivial dev tasks acceptably fast, and I'm going to need a much beefier model, perhaps one not so aggressively quantized, to keep it on task without the wheels completely falling off. Already we're talking serious money outlay, perhaps still within my programmer salary to accommodate, but just barely. And we're not even where near the performance characteristics a frontier model can support.

verdverm 6 hours ago||

DGX Spark runs this sized model (I personally like qwen36moe better than gemma4moe) at speeds fast enough for interactive coding sessions. Algorithmic advances like DiffusionGemma ~4x token gen speeds (https://deepmind.google/models/gemma/diffusiongemma/)

matheusmoreira 8 hours ago|||

We can run open weight models on our own machines.

em-bee 8 hours ago||

yes, but a model that runs on my own machine will never have the capacity of a model that runs in a datacenter. as i said, it can't compete with that.

thewebguyd 8 hours ago|||

If RAM prices ever come down, you can have a machine that can run a capable local model.

Qwen 2.5 72B is surprisingly capable, almost on par with GPT-4o if not a little better. You can run it on a 128GB Mac Studio with 8-bit quantization. You need about 77GB for the weights and ~15GB for your context window & cache.

Pricing remains to be seen, but there's also those new nvidia laptops coming out the surface laptop ultra should have 128GB RAM w/ Blackwell GPU, they're saying 1 petaflop of AI compute, if you can tolerate Windows (no idea if it'll boot Linux until the hardware is out).

These models are roughly ~1 year or less behind the frontier models. We really just need hardware to catch up and alleviate the price pressure on RAM.

rustcleaner 5 hours ago||

>If RAM prices ever come down

Maybe an unpopular opinion here (seening how Y-combinator is his baby), but I think OpenAI and Sam Altman should be financially decimated for cornering the DRAM market. What he's done is a step or two removed from what the Hunt brothers did. His buy-up of future DRAM silicon has measurably harmed personal computing, and he should not get to walk away with a 'win' from it.

randbyte 6 hours ago|||

> a model that runs on my own machine will never have the capacity of a model that runs in a datacenter.

I don’t think so. A local run model only needs to serve one or a few people. It seems possible to run a DeepSeek v4 model at full capacity on a server costing 200k usd. Very expensive but not impossible.

Factor in hardware and software improvements over time, and the fact that most people may just need to run a smaller and quantized model, it should take a pc at 10k usd scale.

melozo 7 hours ago|||

Huh? Open source is a quality of the software, not specific to the hardware used to run the model. The demand is that model weights are openly available for anyone to run and fine tune without restriction. Has nothing to do with the hardware it runs on.

ls612 8 hours ago||

Call it open weights if you must. But even with OSS just because you have the source code doesn't mean your machine is high performance enough to run it usefully this has always been true.

inciampati 5 hours ago||

There is nothing more surreal in AI chat than entering your own name and being told you are a banned topic. Open source models must win. There is no alternative.

TowerTall 3 hours ago||

And it will, but be patient. I took linux 25 years to conquer the world.

One day an open source model reaches "good enough" level. Maybe around the level the current frontier has and most people will use that

tomashubelbauer 30 minutes ago|

I don't even need today's frontier, give me a local model I can run on my Mac comparable to Claude 4.5 as of December last year and I'll probably lose any interest in new hosted LLM advancements altogether.

never_inline 6 hours ago||

I think articles this light on content should not be upvoted to front page.

3s 5 hours ago||

It's a perfect prompt for a rich HN discussion so while in general I agree with you, in this case the discussion is what matters.

raffael_de 1 hour ago||

This is almost always the case. Discussion quality went down during the last few years but HN is still _the_ place to attract people who really know what they are talking about.

ls612 6 hours ago||

I think that the events of this evening (really of this past week) are almost unprecedented in the history of tech. Sometimes a clear and concise message is more important than nuanced analysis.

earth2mars 7 hours ago|

This should be the top post. Not Anthropic or OpenAI marketing plots. This is existential.

echelon 7 hours ago|

It's too late.

You can one-shot a port of Linux to Rust and stop contributing to open source.

The value of software is going to tend towards zero. The value of the software developer the same.

Anthropic is now a kingmaker. It gets to decide which businesses get the expensive private model that can generate entire business functions at the drop of a hat. If you can't afford the price tag, then competition in the market is not for you.

Computing is no longer "personal". It's for big biz only.

slopinthebag 6 hours ago||

> You can one-shot a port of Linux to Rust and stop contributing to open source.

Touch grass brother. Seriously.

More comments...