Posted by vednig 2 days ago
As long as these models require a lot of computing power, the best models open source or not will be served by corporations who can afford the infra.
That’s really the only thing stopping people from training or running these models at home:
Got a bit more than 1B tokens for $10, it's exceptionally fast, it was able to fix/implement things that 5.5 xhigh struggled with, without trying to act like my best friend or do that coy "undersell the ideal end result so that it can later overshoot it and claim a great success" bullshit.
E: miss me with the "but China" BS, everything I've experienced while using this model has convinced me they are earnestly more concerned with doing the right thing than Anthropic could ever pretend to be. And if you want to ask it questions about Mao, you can go download the weights and spend mid-five-figures to fine tune that out.