Posted by T-A 22 hours ago
Who confirms those requests are legit?
Why the emphasis on sovereign? Open is good enough. No?
Why do we need capabilities in Europe? Because Trump and Xi can't be trusted to keep providing us with new frontier models in the next years.
Nothing below that really seems to be good for anything other than training for specific tasks. I have not been impressed by the earlier Apertus 8B model, which doesn't feel like it really responds to nudges.
I am a strong believer in smaller models, so I might try one of these out of curiosity to see if it might do useful things in limited contexts.
> Fully open model: open weights + open data + full training details including all data and training recipes
There are equally open, much more useful models out there: https://artificialanalysis.ai/?models=nvidia-nemotron-3-ultr...
That doesn't mean much to the many people I know of who refuse to use a technology that they see as being unethically created using the work of others without compensating them.
I continue to hope that someone will train a "vegan" model on licensed or out-of-copyright data so those people can experience the benefits of this class of technology.
(I compare them to vegans because, like vegans, I think their ethical position is credible and has merit even though I do not choose the same ethical framework for myself.)
How many normal people do you know who use "ChatGPT"? A lot, probably.
How many even know what "Gemma" is, let alone have downloaded llama.cpp, a GGUF file from Hugginface, and run "llama-server" from a text console with all the correct command arguments? How many are thinking about this use case when speccing out their next computer? Where is the breathless marketing copy boasting x tok/s?
We are sleepwalking into slavery.
Yes, I realise this isn't "running a local model", but it's using models that can be grabbed and run locally. For my pipelines, I feel far more confidence when I use an open model (even one like GLM-5.2 that would be expensive for me to run) since I have a backup plan if the hosted/cloud option becomes unworkable for me. If that happens to me with Opus, I have zero options.
This choice is made for us. The deciding factors will be convenience and economics.
My sense is that just like Web 2.0 SaaS we are destined for servitude.
A better strategy is to play an assymetrical game IMO. Don't let your would-be master write the rules by which you play.
What do you mean by this? Do you have an example in the given context?
You would also be shocked what's possible on a 64GB Mac Studio, which isn't that unattainable.
I can see this as a future battleground but access to frontier models (which you cannot run locally) seems a lot more relevant today.
It's important that people get used to the idea that your interactions with a language model are a highly personal thing. LLMs can perceive and categorize us in ways we can't even imagine, far more violently than the simple algorithmic feeds which have already corroded public discourse so much. LLMs can control us. LLMs warp the information landscape more radically than even the internet did. Even now you are likely underestimating their role in future society.
The principles of software freedom are becoming existentially important.
Of course the frontier will always be unattainable, but that's like pointing out that I couldn't buy my own Cray supercomputer.
That’s a bit hyperbolic…
Yep. I'm an old time Linux sysadmin, but I am COMPLETELY baffled as to what I can or cannot run on my 32GB R9700 with 128GB main CPU memory.
If I want something Claude or Codex like what do I use that would be useful? If I want a chat system, what do I use? Images--apparently ComfyUI for setup but after that what do I do?
I don't even mind spinning up something in the cloud for a bit, but I need to know how I'm going to get data up and down without racking up massive bandwidth charges.
I'd love to do some tinkering, but the field is moving so fast and so full of charlatans that cleaning the dross out is almost impossible.
I don't have recommendations for images because I haven't played with those.
The jokes write themselves.