Autoresearch: Agents researching on single-GPU nanochat training automatically

AlexCoventry 1 day ago||

Wow, Gemini suggested a very similar experiment to me yesterday. Guess I know where it got the idea from, now. :-)

lostmsu 1 day ago||

Non-zero based chart makes it look like it was very successful.

krasikra 18 hours ago||

[dead]

aplomb1026 1 day ago||

[dead]

decker_dev 1 day ago||

[dead]

kubb 1 day ago||

[flagged]

tomhow 2 hours ago||

Please don't fulminate or post snarky, shallow dismissals on HN. The guidelines make it clear we're trying for something better here. https://news.ycombinator.com/newsguidelines.html

hustwindmaple 1 day ago|||

I suspect Ant is already doing this for Claude. Takes a sh*t ton of compute though.

mips_avatar 11 hours ago||

nanochat is super capable, the d34 (2.2b) variant is competitive with qwens of that size. Andrej is I assume building out the improvements in preparation for bigger training runs. We desperately need a truly open model, so i think this is incredibly important.

naomi_kynes 23 hours ago|

[dead]