Top
Best
New

Posted by armanified 1 day ago

Show HN: I built a tiny LLM to demystify how language models work(github.com)
Built a ~9M param LLM from scratch to understand how they actually work. Vanilla transformer, 60K synthetic conversations, ~130 lines of PyTorch. Trains in 5 min on a free Colab T4. The fish thinks the meaning of life is food.

Fork it and swap the personality for your own character.

842 points | 126 commentspage 6
zephyrwhimsy 5 hours ago|
[dead]
adamsilvacons 13 hours ago||
[dead]
maxothex 11 hours ago||
[dead]
solsafe_dev 12 hours ago||
[dead]
Morpheus_Matrix 1 day ago||
[flagged]
agdexai 13 hours ago||
[dead]
ethanmacavoy 23 hours ago||
[flagged]
zephyrwhimsy 9 hours ago||
[dead]
Alexzoofficial 20 hours ago||
[flagged]
agenexus 23 hours ago|
[flagged]
More comments...