Top
Best
New

Posted by programLyrique 21 hours ago

Leanstral 1.5: Proof abundance for all(mistral.ai)
334 points | 93 commentspage 2
strongly-typed 18 hours ago|
Lean is such a wonderful language. So hyped by these releases.
nullc 19 hours ago||
It would be nice if special purpose models provided a some diverse examples of exactly the input required to get its expected performance on a mix of problem types. Maybe also a document intended for LLMs to read that advises on prompt construction.

I've found that you can get wildly different quality results from these sorts of models due to seemingly insignificant differences in prompt construction. It would be much easier to guess at what it wants if I could just see some RL transcripts -- and so the model author is in a much better position to provide initial advice.

ChrisArchitect 17 hours ago||
[dupe] https://news.ycombinator.com/item?id=48738938
rtaylorgarlock 17 hours ago|
Have you ever been downvoted for calling 'dupe?' I once was downvoted after calling dupe on a link posted thrice. HN is an interesting place to hang out, that's for sure.
ChrisArchitect 6 hours ago||
oh of course, unfortunately people's self-interest in pushing their own submissions and not being told they missed a story causes adverse reactions, but a bit of sacrifice worth keeping the site fresh and discussions focused.
yashthakker 17 hours ago||
[flagged]
TokenLens 6 hours ago||
[dead]
moonset 19 hours ago||
I gave Codex with GPT-5.5 High this prompt:

    Identify bugs in [datrs/varinteger](https://github.com/datrs/varinteger) . Do NOT look at the GitHub issues, just inspect the source
It also found the bug that Leanstral 1.5 found and the authors highlighted. I think this bug wasn't especially tricky; it's just a case of too few eyeballs on this repo.

Congrats on the release regardless! Excited for the direction Lean + automated AI proofs are headed.

Disclosure: I work at OpenAI.

adev_ 17 hours ago||
> It also found the bug that Leanstral 1.5 found and the authors highlighted

This is a little bit like someone pointing the moon and you look at the finger.

The formal proof domain goes way beyond just finding bugs.

It has tons of usages in term of functional safety, protocol validation, cryptography, etc...

The fact Mistral tackle this kind of problem is both smart and not so surprising.

Smart because it is niche enough that they do not front face the big competitors (yet).

No so surprising because the French labs have a well known and long time expertise with formal proof tools (Coq and all its Ocaml associated tools). It has been historically mainly pushed by the aerospace and train industries (Airbus, Dassault, Alsthom).

ilc 17 hours ago|||
Given that they directly compare to GPT-5.5 in their documentation. This comes off as puppy kicking to me. They state it is not SOTA, even IN its domain!

Honestly: Think twice before dragging your firm into what you say.

Disclaimer: I speak for myself. Not any firm I am associated with.

moonset 2 hours ago|||
Sorry, I recognize that these are different classes of model and didn't mean to punch down. I'm genuinely excited for the work Mistral is doing in this space!
noperator 17 hours ago|||
Leanstral 1.5 has 6B active parameters. How many parameters does GPT-5.5 have?
lioeters 16 hours ago||
We can run Leanstral locally on commodity/consumer hardware. Nuff said.
InsideOutSanta 13 hours ago|||
GPT-5.5 is what, a trillion+ parameter model? I think the insight here is that you can do this with a tiny model.
8note 18 hours ago||
the mechanism is whats interesting, rather than whether it could do it.

this sounds like a great tool to add to the toolbelt, as part of the "how do we handle all the code output from LLMs" problem

zuzululu 17 hours ago|
I applaud mistral's efforts but reading this release made me realize that Europe is far far behind and that once the gap is solidified I don't think its recoverable in the same way Canada's brain drain had on its economy

The best and the brightest from Europe have no incentive to build in Europe when they can do it in America and be compensated and treated far better

bjelkeman-again 15 hours ago||
At this point I wouldn’t move to USA if you paid me double the salary. There are more things in life than money.

That said, if (or when) the progress of the LLMs flatten out, then I think even Europe can catch up in a few years. If they don’t, and that seems unlikely to me, if the required compute needs to increase at the rate it does today, then I am not sure any of us can predict where society ends up.

InsideOutSanta 13 hours ago|||
I think there is a non-zero chance that Europe stumbled into an optimal scenario where they avoided all the losses incurred by US companies, but still benefit from the research. Having said that, doing so would require moving at the exact right time.
9dev 11 hours ago||
I wouldn't underestimate the capability of European companies to build rock-solid industries on established science. We may not be the pioneers wedging new product categories into the market, but there are millions of highly qualified people here, working hard every day.

For example, Mercedes autonomous driving team is moving ahead at glacial speed, but the system they have so far is excellent and reliable. I'd prefer that over the sad joke Tesla is promoting any day.

zuzululu 12 hours ago|||
Sure some people like to make less money and pay more taxes for ideological reasons. I respect that.
tensor 10 hours ago||
I like to make less money and pay more taxes for higher quality of life reasons. Sure I could go to America and have more many in my pocket by eating mcdonalds every day, but I'd rather eat amazing high quality food and have less money in my pocket.
zuzululu 1 hour ago||
what country is that ? I'm just curious as most European coworkers I have spoken to have not so nice things to say about EU and their country.
InsideOutSanta 13 hours ago|||
Fortunately for Europe, the US is doing its best to make itself both an undesirable and unavailable immigration target.
pbkompasz 11 hours ago|||
treated far better != earn more money
vrganj 6 hours ago|||
As somebody who moved to the US and then back to Europe again, you may get more money there, but you'll pay for it with a much worse environment and lifestyle.

Life's not all about the number on your bank account. Once you're past a certain level (which a competent engineer can easily reach in Europe as well), the marginal utility of money diminished quite quickly.

accurrent 14 hours ago||
[flagged]