Top
Best
New

Posted by huseyinkeles 10/13/2025

NanoChat – The best ChatGPT that $100 can buy(github.com)
https://x.com/karpathy/status/1977755427569111362
1523 points | 308 commentspage 4
mips_avatar 10/14/2025|
Thanks Andrej for putting this up. Your videos gave me the confidence to work full time on LLMs last year after I left Microsoft
oblio 10/13/2025||
I wonder, if something like this were trained on Wikipedia, could it become a reliable local Wikipedia search engine, basically?
simonw 10/13/2025|
I don't think so. Training on documents is not a great way of building a search engine for those for the information in those documents, because the training process mixes all of that information together in ways that detach the individual words from the source documents they came from.

As usual, if you want an LLM to be able to help search a corpus of text the best way to achieve that is to teach it how to use a search tool against that text.

victor106 10/13/2025||
> the best way to achieve that is to teach it how to use a search tool against that text.

Any examples of this?

simonw 10/13/2025||
I've seen this called "agentic RAG" by some people. The easiest way to get a local demo is with Claude Code or Codex CLI. They know how to use grep, and you can set them loose on a folder full of text files and tell them to use grep to answer questions - it can work really well.

I just tried this in "claude --dangerously-skip-permissions":

> Use Python and AppleScript to find Apple Notes that mention UPS

... and fell down a rabbit hole of optimizations because my Notes collection is HUGE, but it got there in the end!

zoba 10/13/2025||
I’m very excited for this. An early question I have: what would need to be done to make this a “thinking” model?
markr1 10/14/2025||
$100 to teach us all how to build an LLM, this is what open education should look like.
saivishwak 10/14/2025||
Very cool project! Hopefully it will propel SLM development
jumski 10/14/2025||
100$ to train a sort of talkable model in 4 hours? wow
desaiguddu 10/14/2025||
I am building a product similar to DataGPT https://datagpt.com/ and Julius.ai - will this help in that?
simonw 10/14/2025|
Not at all. This project is for learning how LLMs work and how to build them from first principles. If you want to solve problems that aren't "how do I build an LLM from scratch" this isn't the right path for you.
earthnail 10/13/2025||
This is absolutely fantastic. I really can't wait for the final course to be live. It's in the "shut up and take my money" category. I had so much fun with the nanoGPT videos.
lebimas 10/13/2025||
I see Karpathy, I click
yieldcrv 10/14/2025|
> nanochat is designed to run on a single 8XH100 node
More comments...