Posted by EvanZhouDev 1 day ago
https://microsoft.ai/pdf/MAI-Code-1-Flash-Model-Card.PDF
Launching seven new MAI models: https://microsoft.ai/news/building-a-hillclimbing-machine-la...
Here Microsoft is comparing against Claude Haiku, the smallest and least capable model from Anthropic.
Seriously tho, wtf is going on over at Meta? Anyone working there currently want to describe the vibe of the org when it comes to being a frontier company?
Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting.
They also did some more interesting work like showing very small models can be coherent as long as you have very simple children's book style training data (TinyStories is pretty famous).
Lots of these ideas are still used. Learning facts at scale with active reading is an ICLR 2026 paper from Meta AI that does a lot of similar work.
If you watch the Build keynote with Satya, you'll notice that the design of the slides changed to Serif typography and warmer colors when Mustafa/Microsoft AI segment came on which was completely different from the rest of the keynote. Now it makes sense!
Where does the Pascal case inspired variant come from? Is it a reference to something? Is it like "M$" was used back in the days?
This model might have a perfect speed:
for i in range(100):
print(random.choices(words))(gestures wildly while changing lanes in his Fiat 500)