Top
Best
New

Posted by amarble 1 day ago

There is minimal downside to switching to open models(www.marble.onl)
372 points | 298 commentspage 3
ZeroGravitas 12 hours ago|
It seems the best self-hosted and the worst models served by big providers has some considerable overlap in quality.

Whatever reason people have to run those (cheaper? backwards compatibility once you get something running) surely applies to the open models too, maybe even more so.

myzek 14 hours ago||
Any tips on which model to use and how to use them? I have 64 RAM and 16 VRAM (I know it's not a lot, it's a gaming GPU) and I'm trying to find a good model to use but it's a bit of a struggle
petesergeant 11 hours ago||
Headline: "The is minimal downside"

Article: "I’m hoping it’s going to be minimal"

peter_retief 15 hours ago||
What open models are "recommended"?

I like the Linux analogy, I struggled with Linux way back.

Animats 14 hours ago||
OK, now what? Someone offers open models as a service? That's basically a time-sharing computing business - people at terminals sharing remote computing resources. If you buy your own H100 it will be idle while you're typing or reading or thinking. So sharing makes sense.

But it doesn't have to be an "AI company". It's just a compute service. The companies that offer web hosting could get into this.

flexagoon 12 hours ago||
> The companies that offer web hosting could get into this.

They already do. DigitalOcean is one of the providers on OpenRouter, for example

HarHarVeryFunny 6 hours ago||
There are lots of companies providing open models as a service. DeepInfra and Fireworks AI for example. Even Amazon for that matter.
PcChip 21 hours ago||
Is it just me or is half the article missing?

I enjoyed the first part though

DANmode 21 hours ago||
But, what model are you using?

and what hardware are you using?

0gs 21 hours ago|
yeah, on a 96GB Mac Studio and Gemma+Qwen, it's definitely fully doable. fully doable but not really for coding on 16GB. but svelter models and cheaper (eventually) hardware are coming!
nezuzen 21 hours ago|||
"cheaper (eventually) hardware" Best case 2-3 years from now. Otherwise it will take a major global recession to get us anywhere near last year's prices.
marcus_holmes 18 hours ago||||
Macs are expensive hardware, but I'm always seeing people running LLMs on them. Is anyone running on cheaper generic hardware and Linux?
numpad0 5 hours ago|||
Qwen3.6-35B-A3B-Q4_K_M.gguf spread across few 8-16GB GPUs is cheap as reward points for a comparable Mac if you don't mind heat, noise, and not-blazing-fast generation speeds.

Most ATX cases only has 7 PCIe I/O shields and can't take more than 3x double slot cards, but many gaming systems can take 2x double slot full length 16GB cards, and they should be fine for many purposes. Cooling is most easily done by a squirrel cage fan mounted with a 3D printed bracket at the back.

Cheap parallel action crimping tools for Molex 5556 works too - PCIe 8-pin is NOT 5557, it's differently keyed, so the specifically PCIe intended housings have to be used for cables, if you are crimping your own.

No one is mining crypto anymore, and crypto PSUs are being dumped dirt cheap, should you want a stable bulk 12V supply.

brucehoult 18 hours ago|||
A Mac is cheaper than a high end GPU with the same amount of RAM.
marcus_holmes 17 hours ago|||
ah, right, so it's about Apple Silicon being fast enough to use instead of a GPU?
brucehoult 16 hours ago||
They use the GPU but an Apple Silicon GPU has the same high speed access to all the RAM on the machine as the CPU does, rather than having its own walled-off maybe 16 GB VRAM in mainstream gaming GPUs or 24 GB in RTX 4090 or RTX 5090 (MSRP $1999 but in practice $3000-$4000 at the moment). Nvidia A100 (80GB VRAM) apparently cost $15,000 or so.

Not only does Apple's unified memory give the GPU more RAM to use, but it also eliminates copying things between CPU RAM and GPU RAM.

A Mac Mini with 48 GB RAM costs $1799. A Mac Studio with 96 GB RAM is $3999 — until March you could get a Mac Studio with 512 GB RAM for $3999, all of which could be used for your AI model.

https://www.tomshardware.com/tech-industry/apple-pulls-512-m...

Some are coming up used at silly prices.

https://www.trademe.co.nz/a/marketplace/computers/desktops/a...

NB NZ$44,999 is "only" US$25,772.

0gs 7 hours ago||
i believe you meant something like US$9999 for the 512GB. otherwise, i'm going to feel like QUITE the fool for choosing the 96GB variant at the same price
fsuts 14 hours ago|||
And use less power
Gigachad 19 hours ago||||
I suspect hosted and local will converge when hardware prices come down and API prices go up. The massive rate of datacenter build out will be unsustainable. Right now the hosted models are massively cheaper than buying the hardware and running it yourself which signals that hosted is very subsidized.
fluidcruft 19 hours ago|||
If you don't have that hardware thr math of buying a depreciating computer is challenging if you are satisfied with the $100/month plans ($1200/year). A 96GB Mac Studio is ~$4k. I think if you have the hardware already as a sunk cost then yes it makes sense. But I'm not sure it is worth spending $4k for today's hardware vs waiting for newer hardware in a few years.
OtomotO 15 hours ago||
I am absolutely pro local and true open source models.

Personally I haven't seen any productivity gain since Opus 4.5 times.

But: I can't fully get behind the opinion that (so called) "open source models" are simply superior and will be in the future, because when I asked some models who they are, they answered with "I am Claude from Anthropic", which could mean they have been trained by exfiltrating Claude.

I have NO moral objection to this, as Anthropic and "Open""AI".also trained their models on anything they could get their hands on.

It's more about the question: can and will these models be updated, even if Anthropic et al fail. Who's gonna pay for training then? What's their incentive? Have we reached a plateau?

fuck_google 12 hours ago|
[dead]
cpill 18 hours ago||
I think once the hardware process comes down and these mini DGXs become cheaper, and by then open models still be smaller and better, there is going to be less and less reason to use the providers. CEOs are already complaining that they are costing too much. There are also large organisations like Banks which can't use external services and are already looking at internal housing. it's a good thing so the big AI companies just went IPO as once the self hosting trend kicks in they are going bust.
aussieguy1234 19 hours ago|
>There was a time not too long ago when using Linux entailed some professional risk1. First there was compatibility: you may not have been able to render a Word document or PowerPoint correctly, and you might have had to trust Open Office’s export capability to render docs the way you wanted

For a while during this era, I used to port my laptops windows installation into a virtual machine that can run on Linux. It took a bit of hacking away but I could usually do it in a day or two. Then its all Linux with the windows vm being used for the microsoft stuff.

More comments...