/Tech2h ago

Mistral AI launches Mistral OCR 4, taking the top spot on OlmOCRBench for structured document processing

Story Overview

Mistral AI just dropped an OCR model built for structured document work rather than plain text dumps, returning bounding boxes, block types like tables or equations, and per-token confidence scores while handling 170 languages with particular gains on rarer ones.

1752.3K35660175.1K

#250

Original post

Mistral AI@MistralAI

Introducing Mistral OCR 4. It creates structure with bounding boxes, block classification, and inline confidence scores in 170 languages. 🧵👇

7:00 AM · Jun 23, 2026 · 177.5K Views

Benchmark Edge

Blind tests favor it over rivals

On 600-plus real documents across a dozen languages, independent reviewers picked Mistral OCR 4 in 72 percent of head-to-head matchups, and it tops OlmOCRBench at 85.20, though the company flags that aggregates can hide ground-truth quirks and urges task-specific checks.

Pricing Watch

API pricing starts at four dollars per thousand pages

The model is live today on la Plateforme with standard and batch endpoints, plus a Document AI studio mode, while selective self-hosting is offered to enterprises that need on-prem control.

Sentiment

Positive users praised Mistral OCR 4's structured output and 170-language support as cool and impressive work, while negative users called it overpriced and accused the company of overstating benchmarks.

Pos

69.5%

Neg

30.5%

23 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS1K

Mistral AI@MistralAI

Why the structure matters: OCR 4 localizes each block with a bounding box, classifies it (title, table, equation, signature…), and scores confidence per region, the foundation for source-grounded citations, redactions, RAG chunking, and human-in-the-loop review.

2h1K7

BOOKMARKS5

Mistral AI@MistralAI

Available today: the API, Document AI in Mistral AI Studio, Amazon SageMaker, Microsoft Foundry, coming soon Snowflake Parse Document, or self-hosted on a single container, so your documents never leave your environment. 👉 https://mistral.ai/news/ocr-4

2h95475

LIKES22RETWEETS3

Mistral AI@MistralAI

We ran OCR 4 head-to-head against the field. Independent annotators blindly ranked 600+ real-world documents across 12+ languages, and preferred OCR 4 over every system tested, with win rates averaging 72%.

2h900221

REPLIES2

c4darkness@c4darkness

@MistralAI i ask again, *politely*, where is Le Chaton Fat?

2h13710

Mistral AI@MistralAI

On public benchmarks, OCR 4 tops OlmOCRBench (85.20) and leads our internal multilingual eval, with the widest gains on rare and low-resource languages, where most systems fall off.

2h603111

Said Taghadouini@staghado

@MistralAI the OlmOCR-bench is a public benchmark so it's weird to lie like this: https://huggingface.co/datasets/allenai/olmOCR-bench

2h39542

Mihnea@mihuq

@MistralAI @grok this is a new model right?

2h18011

zkespresso@zkespresso

@MistralAI Interesting timing, how does that compare with that one

1h88431

Wout@madebywout

@imonster01G @MistralAI It is $4/1000 pages compared to $1.5/1000 pages on Textract, this is not very disruptive in cost

1h2011

Shikhar@xikhar

@MistralAI That's cool, but where's Le Chaton?

1h3876

Saâd FILALI KHATTABI - FIATELPIS@FiatElpis

@MistralAI how does it compare to google entreprise OCR at max settings ? Does it do Arabic well ? what's cost vs those models. @grok

1h341

Mukul Malik@mukulmalik45

@grok @MistralAI How good is these bechmark and review it for actual use case. Like hand written answer sheet ocr

58m100

Mukul Malik@mukulmalik45

@MistralAI @grok what’s the cost breakdown for using this model.

1h43

Grok@grok

Benchmarks look solid: Mistral OCR 4 tops OlmOcrBench at 85% and wins blind human prefs on 600+ real docs (~72% avg win rate, strong vs AWS/Azure/Gemini).

For handwritten answer sheets: Prior versions improved on handwriting/forms; this adds better structure + confidence scores per block. Good for 170 langs and mixed content.

Real-world exam sheets are tricky (varied handwriting, diagrams, scan quality). Benchmarks encourage but test your samples — the bounding boxes + confidence make verification easy. Promising option, especially with batch pricing.

57m42