Posted by arunc 1 hour ago
My teen isn't super interested in AI, but whenever they do feel curious they have their own account they can use on our home network. As far as chatting goes local models are more than capable for handling standard chat questions, doing research, helping troubleshoot problems etc. In fact it was an agent powered by the same model that setup the open webui server and took care of all the account management features through my phone (using Hermes agent).
If you're building AI powered features and using sophisticated agent setups for coding for work, then it make sense to use SoTA from these providers. But I've been using local models increasingly for personal use and am starting to find them preferable (I run an uncensored, ephemeral model for my own use and it's an entirely different experience than anything you can pay for).
Still haven't cancelled my personal Anthropic subscription, but considering it soon.
I guess "starting to find them preferable" suggests to me you think they work better, but this is surprising to me so I think I may have misunderstood, so I ask!
Like you're saying they work better than the proprietary models (in what ways?), or you find them mostly good enough and prefer the privacy or cost, or what?
Having full control over how your data is retained, what the system prompt is, which version of the model you're running, etc leads to much a more consistent experience. For example, for chat sessions, I can't stand the new "let me push back" version of Claude. For my home models I never have to worry about that.
There's never a mystery as to whether the model secretly degraded performance, I always know exactly which model I'm using and how well it's utilizing resources etc. Open models also give you full visibility into the reasoning steps, so you never have to guess what the model is thinking.
Then when you start getting into things like uncensored/abliterated models we're talking about something you can't even pay for. In case you're unfamiliar, even open local models have guardrails built in. But people in the community have found ways to remove these. One of the things I've found most concerning about AI, which is under discussed, is the combination of people having personal chats with an agent that both monitors the conversation and refuses to discuss certain topics. This leads to a very deep level of self-censoring I find dystopian.
I also have multiple hermes agents setup, some with local backends other with open but non-local backends (e.g. Kimi through the API). For some tasks, I've just started to find the local agent tends to work better for the type of tasks I want (maybe it just over thinks less?). I don't use it for coding so much as research tasks and sysadmin stuff, but I've been really happy with the results.
Oh, and let's not forget, especially running on a Mac, these local models are basically free to run.
So from the perspective of your teen, they would benefit from using z.ai or ChatGPT or Claude, etc, rather than the local server where you can see all the conversations.
What uncensored model do you recommend using ?
Dont. Goon. To. LLMs
Capabilities can be gated behind certification programs, or by money, or any other numerous corrupt and non-corrupt means. Model capabilities can be segregated by pricing tiers, creating an economic underclass that cannot afford access to frontier intelligence.
For humanity to benefit, the tech needs to be open and equally available to all.
In a world where everyone is a Claude controller (something I honestly enjoy!), that goes away. I use hundreds of dollars of tokens a month. Suddenly, the kid in her basement with an unloved computer can't get in on the ground floor. You have to be rich to even get started. That worries me deeply. It's a big change for our field, and I don't think it's a good one.
One is the potential for skill rot where AI grows a heavy dependence in new employees and once the real price per token cost is settled on and discoverable (post massive IPOs and probably a while post - not immediately after) we, as a society, are left with a bunch of people dependent on a deeply inefficient technology to maintain software we now view as vital that might severely impede our ability to actually deal with climate change (press X to doubt Bezos).
The second is that the psychological damage of interacting with models in a social context during your formative years is deeply damaging and we've essentially destroyed the ability for a generation or two to actually interact as productive members of society.
Addressing the second issue doesn't necessarily exclude our ability to leverage models for business productivity but it seems unlikely to happen in the current climate without that also happening. I am hesitant to believe in a sudden outbreak of common sense at this point. The first point, could really be a systems collapse trigger - we can argue about the likelihood but denying it as a possibility is excessively naive.
If even one of these had pledge that all profit goes to end world hunger, cancer research, etc, I could possibly see it - but they haven't. They're all after finding a way to be the biggest, richest asshole possible with the ability to crush anyone in their way..
Why on earth would you want to siphon off the proceeds of AI development to (ok my bias is strong here- mostly corrupt) "ideals" like world hunger and cancer research (that probably get more dollars annually than the sum of actual profit any of these companies will ever get). That would just instantly kill the ability to improve AI at all, and the world could possibly be better for a few months?
Would these businesses pay 2x? 5x? 10x? What is their breaking point? I'm sure xAI/OpenAI/whoever will find it and charge 0.9x that (eventually). Just look at telecoms / internet access and their rubbish "network congestion" claims to keep raising prices.
And how do we prevent Chinese companies from training on our open AI models and offering their models for free?
https://www.ibm.com/policy/contributions-and-expenditures
Their biggest customer is the US federal government, taken in aggregate across agencies, IBM is one of the largest federal IT contractors, and deep public-sector and financial-services contracts in the US make it IBM's single largest national market. No individual commercial company comes close to the government's aggregate spend.
Now, equivalent product, another company, they want to sell to the government twice cheaper, can they ? nope, it will be IBM winning.
Furthermore, according to the lobbyists, China = evil but they forget that a lot of software contains Chinese code.
the potential of wealth creation with AI is so high, and also the fact that research, pre-training and inference is so expensive that, that any open-AI would eventually become OpenAI.
I mean from a financial and sustainability standpoint, assuming they’re equally powerful as their proprietary counterparts.
I guess I’m trying to understand the economics of it.
However, I would highly suggest more people experiment with these smaller models. They are incredibly capable in many ways that many people dont realize.
The perceived capabilities of the larger models are also much less the result of the model having more parameters/training cycles, but rather that they are being run through well-made harnesses, something which the open-source community is rapidly approaching with near-peer solutions of their own.
In short, much of the gap between between open-weight models and the larger proprietary models can be considered more of an issue of perception and not an issue of capability. There is a fundamental gap economically, but not so much in capability. The open source community is rapidly closing the gap on these larger labs, especially thanks to the amazing research being freely given openly by well funded chinese labs.
You can run fantastic local models if you have either:
- M-series Apple device with ideally >= 24GB of VRAM
- RTX [345]090 GPU
I'm fortunate enough to have both and use an M-series laptop as basically a persistent server (I don't use it much and when traveling typically just use my work laptop). My desktop doesn't act as a persitent server but I fire up llama.cpp on it all time for quick chat sessions.
If you have one of the above devices and can dedicate it as server there are additional layers of tooling you can use that dramatically improve the experience. In particular Open WebUI allows you to add tons of useful tools (image gen, web search, code eval, etc), and agent harnesses like Hermes can make the current gen small models very capable. I have an agent in chat on my phone that basically handles all the sys-admin for the server it runs on.
You can drastically reduce the requirements by running models at a lower bitrate, which somewhat reduces accuracy but not that much - think of the difference between an MP3 vs uncompressed audio. With this and other tricks, you can get high end models down to a size where they can be run on a high spec desktop workstation affordable by an individual or small business.
Obviously I'm heavily oversimplifying here. I think a useful parallel is to consider situations from the past where you would once have required corporate budgets equivalent to the price of a house to run a large database, but over time it became accessible to anyone with the requisite expertise and relatively affordable hardware.
That's still a lot of money, but most people don't really need a trillion parameter model. If privacy is more valuable than the frontier capabilities then they could almost certainly get by with much less.
They are not SOTA in various ways but they have better economics.
Given they have laughable uptime and I have yet to find a useful project mostly written by claude... I doubt it.
Here is the Wayback Machine archive from April of their identity verification help page: https://web.archive.org/web/20260415064244/https://support.c...
Here's a random Reddit thread from 2 months ago about them rolling out identity verification: https://www.reddit.com/r/ClaudeAI/comments/1smr9vs/claude_is...
Here is one random example thread of someone who got caught in identity verification with multiple follow-on comments from people who encountered the same problem, also 2 months old: https://www.reddit.com/r/ClaudeCode/comments/1sx25kd/buyer_b...
> What are you pushing by pointing out not only in this thread but the previous one too, quite in depth, that it's not new? I know you can claim you're just a stickler for accurate reporting but you seem really invested
Because these threads degrade into panic with the assumption that everyone is getting ID checked now.
Pointing out that the policy has been in place for 2 months puts it into perspective that this isn't a sudden policy change requiring ID for everyone. It's been this way for months.
If they roll out mandatory ID checking for everyone, that would be a different story.
Seems like US wants to get ahead on this and be #1
Also Sam Altman will love this idea, because he already tried it with Worldcoin
It is using the proof of age requirement to require a much larger ask -- full proof of identity
Age verification could be done with any of a variety of mathematical systems showing you have a proven age-valid ID but not revealing your identity. But no one is suggesting they build and use such a system.
Comparing a private company's service to something run and maintained by an entire government on their population is disingenuous, to say the least.
Because one is a private company that people can choose to use or avoid. The other is a government that can force things upon people. How are they the same in any way?
You know many companies check ID, right? You submit ID for a lot of activities. This isn't a new concept that Anthropic invented.
Meanwhile in the Land Of The Free:
> Prairieland defendant sentenced to 30 years in prison for moving a box of antifascist zines
https://theintercept.com/2026/06/23/prairieland-texas-ice-pr...
> US President Donald Trump threatened a "10 year prison sentence" to anyone caught vandalizing the Lincoln Memorial Reflecting Pool
https://www.dw.com/en/us-trump-threatens-prison-for-reflecti...
free speech, civil liberties, voting, are in China all well below the standards of the west. The criticism and complaints were completely warranted and are still true today, whereas your comment falsely implies there is some parity.
could your comment be repaired to be reasonable? why bother, just read the rest of this discussion where people are debating these controls without trying to exonerate China.
Haven't heard China deporting students for criticizing foreign state or electing people involved in p*dophilia and then closing eyes for those crimes
If you do business with totalitarian societies that aren't made to liberalize, you too will become a totalitarian society.
No, they liked China because the standard of living meant that it was easy to improve people's lives while also keeping them in line via a government that wasn't above grinding protestors into hamburger with tank tracks. The bar to clear wasn't "maintain the American standard of living", it was "provide more calories than Mao did during the Great Leap Forward", and so long as they could do that, they'd get to do whatever else they wanted with the workforce. Anyone who wanted more would get to deal with the CCP.
Countries such as Canada are in the process of implementing regulations to prevent repeats of the Tumbler Ridge incident. A disturbed person was basically attaboy'd by AI into a mass shooting. The discussions this person had with Anthropic's AI triggered some alarm bells at Anthropic, but they did nothing about them. If future shooters were to simply use AI chatbots under assumed names, there wouldn't be much AI companies could do about it, except maybe change their bots to stop offering mindless affirmation. At the same time, there is a move by multiple governments around the world to ban children from using AI. You can't meet that legal requirement without age verification.
On the other hand, even Americans don't trust their own corporations with their personal data. People outside of the U.S. are even less trusting thanks to the completely amoral nature of the present U.S. administration and their steadfast opposition to any kind of sensible regulation.
The chickens are coming home to roost.
Not that I like that route, but may be the only way Anthropic can keep releasing new models with the current administration.
«Where data is de-identified, Anthropic will maintain and use this information in its de-identified form, and will not attempt to re-identify such information, except as permitted by law.»Too bad we can't contact them if we have issues.
Having my engineers swap over to it from Claude has garnered very little complaint. The lack of multi-modality is a limitation, but using minimax m3 for that isn't super inconvenient.