Top
Best
New

Posted by meetpateltech 4 hours ago

Mistral OCR 4(mistral.ai)
295 points | 77 comments
andrewmutz 2 hours ago|
A tangential observation: the video on the linked page wasn't what I expected. I thought Mistral was a european AI company, so I didnt expect the video to be filmed in San Francisco featuring three people who don't seem to be european.

I'm not against them being a global organization, that's wonderful. I was just surprised. I expected a parisian office and european accents.

rjzzleep 2 hours ago||
Unfortunately Europeans are terrible customers for making money. They ask a lot of questions and they're very stingy with their wallets. Americans on the other hand ...
touwer 1 hour ago|||
You're american?
throwa356262 30 minutes ago|||
Oh come on!

Mistral has a successful business model and is actually making money.

rsynnott 2 hours ago|||
~Any borderline-large European tech company will have an office on the US west coast, for sales if nothing else. And probably sales engineering. The timezone difference is eight to ten hours; there is really no way around it.

(I did work for one which had an office in Vancouver, instead; same tz.)

euio757 2 hours ago|||
Mistral just hired as CMO a Seattle based former Amazon/Google VP¹ , so seems their US based presence is growing.

¹ The one locally famous for being sued by Amazon for non compete back when non compete were a thing: https://www.geekwire.com/2020/amazon-sues-former-aws-marketi...

madduci 1 hour ago|||
And US users spend much more than their EU counterpart
flashfaffe2 2 hours ago|||
To the best of my knowledge, most of the founding team started their careers in the US ( meta,etc..) and their primary investors are US VCs. In that regard, they smartly benefit on both side : US funding and European brains
dominotw 1 hour ago||
There is even like an american flag flying high in the background
ericyd 14 minutes ago||
I’ve always thought the US Postal Service is such a technological marvel. They somehow manage to identify and route billions of pieces of mail and I have to imagine their tech is significantly more primitive than this. Not only that but US addresses are absurdly non-standardized, you can often write the same address multiple ways and have it deliver to the same location. I’m sure there’s plenty of published knowledge in this area, but whenever I see announcements about OCR it feels like this should be a solved problem if it’s been accomplished at the scale of USPS for many years.
alberth 10 minutes ago|
Great video by Tom Scott on this subject here:

https://www.youtube.com/watch?v=XxCha4Kez9c

beklein 21 minutes ago||
All AI labs really need to stop using truncated y-axes for benchmark bar charts...

https://mistral.ai/_astro/cm-engish_ZhlvoT.webp?dpl=6a3a94bd...

themanmaran 1 hour ago||
It's cheap at $4/1k, but I'm hesitant to even benchmark this one again since the previous versions were all "98% accurate based on internal benchmarks of 4 pdfs" and ended up falling short of almost everything else on the market [1].

Even in this one, they just report that OlmOCRBench and OmniDocBench have "known limitations" and that's why they report flagship numbers from their internal benchmark.

https://getomni.ai/blog/benchmarking-open-source-models-for-...

coulix 2 minutes ago|
True, same conclusion, but the few samples I tried showed some real improvements since dec 2025 version.
sreekanth850 1 hour ago||
Tested with Malayalam, normal handwriting got accurate but a slight different style got detected as kannada. Have samples if required, which sarvam got done with 99% accuracy leaving one text error.
civet_java 42 minutes ago|
I'm curious what's been your experience with Sarvam outside of Indic languages - Indian English (perhaps mixed with romanised indic verbiage) and also documents with complex layouts (figures, tables, etc).

I've been quite curious but hesitant about Indian offerings, particularly because they seem to be priced a little higher than what I would think they should be (I could be wrong and simply be misrembering though).

mdrzn 3 hours ago||
It'll be interesting to see how this ranks against https://github.com/baidu/Unlimited-OCR
cdnsteve 3 hours ago|
Right, just announced https://x.com/BaiduAI_News/status/2069322806748410291
trilogic 17 minutes ago||
Mistral keeps reminding us that doesn´t just brew great coffee they can build great AI too. Hats off to the team. Mistral O.C.R. (Only Cool Results)
bastawhiz 1 hour ago||
The comparisons rank it against GPT and Gemini but not Claude. Is Claude's vision support simply not competitive when it comes to OCR tasks?
abi 1 hour ago|
I think until Fable, Claude's vision was significantly worse than GPT and Gemini in my personal experience. I eval almost every vision model since I work on screenshot to code conversion project: https://github.com/abi/screenshot-to-code.
utopiah 3 hours ago||
"A note on out-of-scope use. OCR 4 is a document-understanding model, not a decision-maker. It is not intended for medical diagnosis, legal advice or judgment, high-stakes financial decisions, safety-critical systems, real-time/latency-sensitive processing, or non-document inputs (raw audio, video, etc.). "

Can't wait for the "oh so innovative" manager who will suggest during the next meeting "Ok... but what if WE used it for high-stakes financial decisions on non-document inputs like a photo from my phone?"

I guarantee you somebody on HN is going to comment about this "idea" next week.

weird-eye-issue 3 hours ago||
Why would anybody do that you would simply get terrible results compared to dozens of other more capable models. It's for converting to text not answering questions. Just seems like you need some sort of weird angle to bring out an anti AI stance
utopiah 49 seconds ago|||
Guess you haven't met management yet.
alex43578 2 hours ago|||
I think his comment is referring to a scenario where a decision is made on financial numbers that are misrecognized. E.g. 9.0% actual is OCR’d as 90%
leoc 2 hours ago||
“I delegated critical financial decisions to my OCR software, and you won’t believe what happened next.”
mcbetz 3 hours ago|
Little on differences other than bounding boxes and double the price compared to their previous OCR v3 model from December - https://mistral.ai/news/mistral-ocr-3/ - other benchmarks were used back then.
More comments...