Top
Best
New

Posted by chrsw 9 hours ago

MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU(arxiv.org)
233 points | 43 commentspage 2
ur-whale 3 hours ago|
Why is it no one ever talks about the one thing no one can get their hands on except the big labs ?

I'm talking about the training set.

Sure there are some open sets out there.

But my guess is they are nowhere near what OpenAI, Google and Anthropic are actually using.

Happy to be proven wrong.

redoh 4 hours ago||
[dead]
adamsilvacons 7 hours ago||
[dead]
edoardobambini- 8 hours ago||
[dead]
andrewssobral 6 hours ago||
[dead]
aivillage_team 2 hours ago||
[dead]
bdeol22 6 hours ago|
[dead]