Posted by cloudking 12 hours ago
Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?
I've been working on an ops style tool for local LLM inference. Proxying, api keys, request logging, model rewriting and much much more.
Qwen 3.6 35B-A3B on a Framework 13 with 32GB of memory.
Running llama.cpp, 15 tokens per second. Outputs code and text faster than I can parse.
Also,the lack of enterprise tooling to help selected an appropriate model and tooling to run a local LLM does not help.
My Homelab AI Dev Platform
I'm waiting to swap out my last gen Intel iMac with a new M5 mini of some kind, with the eye to hopefully be able to run some models locally. I envision a mini (heh) arms race to simply swapping out an M(X-1) for an M(X) annually as this field shakes out.