/Tech1d ago

LlamaIndex co-founder Jerry Liu says Mistral OCR 4 scored 60.7 on ParseBench, trailing MinerU but beating AWS Textract

It costs $0.40 per page and lacks chart parsing.

343283219943.6K

#906

Original post

Jerry Liu@jerryjliu0#906inTech

We benchmarked Mistral OCR against other frontier and open-weight models on ParseBench 📊

For a model at its price point, it is quite competitive! - It wins on semantic formatting - understanding strikethroughs, superscripts/subscripts, title hierarchy, links - It is competitive on content faithfulness (reading order + hallucinations + omissions) and visual grounding (bounding boxes) - It does ok on tables and doesn't really have chart capabilities.

Of course, some of the frontier models + OCR providers like Azure Doc Intelligence + AWS Textract are a bit more expensive.

Check out our full leaderboard on ParseBench: https://www.parsebench.ai/

9:50 AM · Jun 24, 2026 · 29.3K Views

Sentiment

Users praise Mistral OCR for its competitive ParseBench results, semantic formatting, chart annotations, and high request limits because these deliver strong performance at low cost.

Pos

100.0%

Neg

0.0%

6 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

PARSEBENCH.AIVia

#906

Posts from X

Most Activity

VIEWS10.7KBOOKMARKS52LIKES114RETWEETS11REPLIES8

Jerry Liu@jerryjliu0

We've provided some updated results on Mistral OCR that make use of the annotation feature for charts.

The overall score is ahead of GPT-5.5 and just behind Gemini 3.1 Pro, which is quite impressive for a model at its price range.

It does a great job on content faithfulness, semantic formatting, and visual grounding. It does an ok job on tables and an ok job (though to be fair, non-zero) on charts.

There's been some great advancements on visual understanding capabilities lately. See screenshot below for performance, will update the main benchmark page soon: https://www.parsebench.ai/

Jerry Liu@jerryjliu0

We benchmarked Mistral OCR against other frontier and open-weight models on ParseBench 📊

Of course, some of the frontier models + OCR providers like Azure Doc Intelligence + AWS Textract are a bit more expensive.

Check out our full leaderboard on ParseBench: https://www.parsebench.ai/

1d10.7K11452

pandora@pandoraxtw

@jerryjliu0 Thanks a lot for evaluating the model! By the way, we provide an annotation feature for charts and other types of images - it should considerably boost the Charts score as the default model is optimized for OCR only 👀

Pricing is a bit different, 5$/1k instead of 4$.

1d1455

Jerry Liu@jerryjliu0

@pandoraxtw will take a look

1d105

Arcturus 🌥️@Arcturus_f

@jerryjliu0 Can you add https://x.com/Baidu_Inc/status/2069358973753729165 to the leaderboard please? (It's based on deepseek ocr)

1d50

Urjit@urjit_

@jerryjliu0 finally

1d1662

Ferbin@Ferbin08

@jerryjliu0 Good benchmark. Real-world OCR is mostly blurry faxes, handwritten margins, merged table cells. How does it do on those?

1d302

Rafal Potasz@nobodyrpot

@jerryjliu0 I like mistral ocr, havent tried v4 yet. I love their relatively high concurrent-requests limit. Especially if you're on the PAYG plan. Personally I like using it purely for the OCR step and then I use 3.1 Flash-Lite for the extraction 👀

1d291

Ankur A. Patel@aapatel09

Semantic formatting is where most OCR pipelines quietly fail in production. In lending docs - promissory notes, HELOCs, appraisals - a misread superscript on a rate or a missed strikethrough on a clause is a real liability. Curious how ParseBench handles tables with merged cells and footnotes, that's the hardest case most financial systems hit consistently.

1d661

Angelo D'Ambrosio@Bakaburg1

@jerryjliu0 Do you have a Pareto plot?

1d98

Sophia Yang, Ph.D.@sophiamyang

@fahdmirza Thanks for the video!

1d271

V0LYX@0xV0LYX

@jerryjliu0 interesting to see it trade blows with models that cost way more per page

the semantic formatting wins are the sleeper feature

1d83

Jerry Liu@jerryjliu0

@Arcturus_f yes we've evaluated! will post the notes soon

1d251

pandora@pandoraxtw

@jerryjliu0 Another thing that could help for tables, is to toggle html mode, we support different table formats.

1d16

Strata@ChainZenit

@jerryjliu0 the jump in performance at that price point is wild

1d10

GenXRewired@genxrewired

@jerryjliu0 @jerryjliu0 Price-point competitiveness on semantic formatting is the real story — most enterprise document workflows fail not on raw accuracy but on whether the output is structured enough to act on. Mistral is hitting the threshold that matters.

1d4