Open source AI must win

Posted by vednig 2 days ago

Open source AI must win(opensourceaimustwin.com)

1575 points | 471 commentspage 3

b3ing 1 day ago|

I agree but we are dependent mostly on Chinese models at this point to pull it off.

vorticalbox 1 day ago|

where they came from doesn't matter so much so long as they are open weights and I can run on servers I control.

puttycat 1 day ago||

The final weights in themselves don't tell you anything about what went into the training process, e.g censorship.

WillAdams 1 day ago||

The win I'd really like to see would be for remuneration of training data, and for a provenance of all the data used by a given LLM.

softwareseko 1 day ago|

[dead]

inciampati 2 days ago||

There is nothing more surreal in AI chat than entering your own name and being told you are a banned topic. Open source models must win. There is no alternative.

manoDev 2 days ago||

Don't worry, open source AI will win. There's a reason everybody is desperate to IPO fast and get an exit, their competitive advantage is not lasting long.

ramcrissesangry 2 days ago||

As an person whos getting into tech and already developing a game, the fact that laptop prices since 2020 have increased by 20-40% is insane. It's delaying the time to create my game. I researched the reason for the cost spike, and most of it is from the excessive money put in ai Technically, the owners of AI could slow down the amount of GPUs and RAM they buy because AI has almost reached its most usable peak. Everything they add just introduces more bugs, so instead of building more AI centers, they should focus on improving the main AI model with bug fixes. There's no need to give it more unnecessary power. Most people don't care; the entire business is run by a few old men who think AI is everything and invest huge sums of money to show other AI companies they need to improve to get more funding from old people. We just need to find something new and innovative for older investors to focus on, so not everything is about investing in AI like Roblox, OpenAI, Google, etc. The extreme amount of reasoning power given to AI is causing bugs, and the moments when AI had outbursts towards people are related to this.

echelon 2 days ago||

> because AI has almost reached its most usable peak

It doesn't seem to be showing any signs of stopping. Have you used Fable 5? It's a fantastically capable model and trumps anything that came before it. Seedance 2.0 is categorically the best video model, and it's only a few months old.

> the entire business is run by a few old men

Startups tend to skew young, and in this case it's no different. Most of the leaders of AI companies are decades younger than the CEOs in other types of industries.

> who think AI is everything and invest huge sums of money to show other AI companies they need to improve to get more funding from old people.

They're spending capital to win market share and to try to build a moat. One of the most important things in business is building a durable way to keep competitors from taking your market. You spend enormous capital to win customers, and it would suck if other businesses could watch what you did, spend less money, and come in and take everything away. The money being spent is an attempt to have a durable lead.

It's working. Enterprise contracts are deep and sticky tendrils that work through governments and large companies. Both OpenAI and Anthropic have massive partnerships with Fortune 500s, the DoD, you name it - and these contracts will last and print enormous amounts of money. This makes it incredibly hard for other players to enter the market and build a cash flow with which to compete and thrive.

> find something new and innovative

This is easier said than done. It's an incredibly hard problem. It took decades to find the last big technological waves: the PC, the internet, broadband, smartphones. Now AI. These are generational step function increases. The groundwork can be decades old, but it takes time to proliferate before it can become a big business.

Other possibilities include fusion, green tech, quantum computing (useful for crypto breaking, etc.), AI drug discovery, etc. If you go into research one day, try to find an interesting field with potential for commercialization - that could make you very wealthy if you find something you enjoy working on, with lots of greenfield opportunity, that is ripe for turning into products.

Good luck with your game! You should post it here on HN when you finish. You'll get lots of great reviews, comments, and early players. :)

ramcrissesangry 2 days ago||

thx I will consider what you sent.

rustcleaner 2 days ago|||

They want to corner the compute market and destroy the personal [sovereign] aspect, so that you are forced to subscribe and pay them regularly [indefinitely] and the US security state can surveil you. Never subscribe, and never buy products from companies who subscribe. Starve them, bankrupt them! We do it by not subscribing!

dakolli 2 days ago||

Why have you sent this same message multiple times?

ramcrissesangry 2 days ago||

I didn't know how this worked I thought it deleated it, at first.

aberzun 1 day ago||

It will win - in the sense that AI too will become a freely available resource. You can't stop progress.

My bet is that once cost-efficiency becomes a priority, we will figure out ways to get away from the expensive GPU infrastructure on figure out how to architect models for CPUs. I still remember that Microsoft paper about ternary weights.

egonschiele 2 days ago||

I have been working on this exact problem, and I suppose now is as good a time as any to talk about it.

To make any agent "good", there are two components: the model and the harness. Very few companies can train models, but anyone can build a harness. How much does the harness matter? Can I build a harness that's good enough that I can use open source models with opus level performance? That's the question I've been trying to answer by building better harnesses. None of the existing frameworks have the functionality I need to build a good harness. The features I need are language-level... and so I started building a language called Agency[0].

It's been six months and its going well. Some of the things Agency can do are wild:

- It can pause and serialize execution at any point, making HITL easy

- It has some neat safety capabilities such as handlers[1] and PFA[2]

- You can bundle up any agent as an HTTP or MCP server[3]

- I'm now working on a built-in optimizer to optimize agents (think DSPy).

Obviously, it's a huge undertaking, but having worked with the Agency for six months, I can't imagine going back to another framework. It makes things so easy. I'm working on its built-in agent now [4]. My goal it to get it to be as good as Claude Code, but using open source models. It's still early days, lots of rough edges, but if this sort of thing interests you, I'd love to have a few more people test it out.

[0] https://agency-lang.com

[1] https://agency-lang.com/guide/handlers.html

[2] https://agency-lang.com/guide/partial-application.html

[3] https://agency-lang.com/cli/serve.html

[4] https://github.com/egonSchiele/agency-lang/blob/main/package...

bozdemir 1 day ago||

I think we need it to prevent slavery happening again.

blueblisters 2 days ago|

I'm assuming this is popular because of Fable restrictions. AFAIK, open source is not excluded from ITAR / EAR restrictions (or other export restriction in other countries).

So the real solution you're looking for is technology that can't be arbitrarily gatekept by a sovereign nation.

cududa 2 days ago|

I’ve been exceptionally displeased with Claude Code since end of February and switched completely to Codex in April. The blasé way in which one person (Borris) capriciously changes the system prompt multiple times a day, also no longer writing his own prompts (whatever that means).

That, the 5 different secret levers you have to pull to make it not stupid, the fact you hs e to go to the guy’s twitter account to find all the un-dumbing features and flags that aren’t documented anywhere else. That they decrease thinking budgets silently when they run out of compute instead of announcing the rationing, and gaslighting users at every step of discovery. The fact that internally they have their own coding harness and don’t use Claude Code primarily. The lack of formal evals and consideration for millions of users collective hundreds of millions of hours of investment in their workflows — that’s all off the top of my head, let me tell you how I really feel about what they did to Claude Code..

I adore gpt5.5 and maintain my own codex fork - but I have no idea how long I’ll get this performance / cost - I know it won’t be forever. I’d like to know precisely how much it’ll cost in hardware to run a gpt5.5 open source model locally. Hell a lifetime license to a model I can run locally is also be open to.

But I like building my own tools, from software to physical shop tools. I like being able to rely on my tools.

More responding here to the assertion that this is blowing up due to Fable.

More comments...