Posted by MallocVoidstar 22 hours ago
Card: https://deepmind.google/models/model-cards/gemini-3-1-pro/
A .1 model number increase seems reasonable for more than doubling ARC-AGI 2 score and increasing so many other benchmarks.
What would you have named it?
Basically, what does the word "Preview" mean, if newer releases happen before a Preview model is stable? In prior Google models, Preview meant that there'd still be updates and improvements to said model prior to full deployment, something we saw with 2.5. Now, there is no meaning or reason for this designation to exist if they forgo a 3.0 still in Preview for model improvements.
GMail was in "beta" for 5 years.
That is why I'd prefer for them to finish the role out of an existing model before starting work on a dedicated new version.
Wonder how GP feels about the minor bumps for other model providers?
For a stable deployment, Google needs a sufficient amount of hardware to guarantee inference and having two Pro models running makes that even more challenging: https://ai.google.dev/gemini-api/docs/models
Useless.