Posted by T-A 1 day ago
The jokes write themselves.
no, actually, from the docs it sounds mainly motivated by the country's unique linguistic requirements.
the swiss have no gpus
I can run the 8B version of this swiss-ai model on a ten year old GPU. For the larger one, $2000 consumer hardware can run it fine. Beyond that, there are plenty of places where time on a GPU can be rented, and if the model is good, there will be hardware to run it.
My charitable reading of GP's point is that the bottleneck for true compute sovereignty is the chips, not the models.
There were a number of use cases where we needed to use Gemini (audio modality), and Ultra has been a VERY cost-effective alternative once we got through the nuances.