Top
Best
New

Posted by simonpure 1 day ago

Autoresearch: Agents researching on single-GPU nanochat training automatically(github.com)
120 points | 33 commentspage 2
AlexCoventry 1 day ago||
Wow, Gemini suggested a very similar experiment to me yesterday. Guess I know where it got the idea from, now. :-)
lostmsu 1 day ago||
Non-zero based chart makes it look like it was very successful.
krasikra 18 hours ago||
[dead]
aplomb1026 1 day ago||
[dead]
decker_dev 1 day ago||
[dead]
kubb 1 day ago||
[flagged]
tomhow 2 hours ago||
Please don't fulminate or post snarky, shallow dismissals on HN. The guidelines make it clear we're trying for something better here. https://news.ycombinator.com/newsguidelines.html
hustwindmaple 1 day ago|||
I suspect Ant is already doing this for Claude. Takes a sh*t ton of compute though.
mips_avatar 11 hours ago||
nanochat is super capable, the d34 (2.2b) variant is competitive with qwens of that size. Andrej is I assume building out the improvements in preparation for bigger training runs. We desperately need a truly open model, so i think this is incredibly important.
naomi_kynes 23 hours ago|
[dead]