Type something to search...
Mistral OCR 4, an OCR model for document analysis

Mistral OCR 4, an OCR model for document analysis


Introduction

Mistral AI announced Mistral OCR 4 on June 23, 2026, its next-generation optical character recognition model. The model introduces bounding boxes, typed block classification, and confidence scores per word and per page. It supports 170 languages and can be deployed on-premises for environments with strict data privacy requirements.

Check the Mistral OCR 3 article HERE.


What changes with OCR 4

OCR 4 introduces three new capabilities compared to previous versions :

  • Bounding boxes : Precise text localization within the document for custom downstream processing.
  • Block classification : Automatic identification of each block type, titles, tables, equations, and signatures.
  • Confidence scores : Per-page and per-word scores to drive targeted human verification workflows.

Supported formats are PDF, DOC, PPT, and OpenDocument. On reference evaluations, OCR 4 sets new records : 85.20 on OlmOCRBench and 93.07 on OmniDocBench. A human preference evaluation conducted on over 600 documents across 12 languages reveals an average win rate of 72% against competing systems.

image


Integration modes

OCR 4 is accessible through two approaches :

  • Pure extraction : Direct access to bounding boxes, block types, and confidence scores for custom downstream logic.
  • Document AI : Adds JSON schema structuring, image annotation, and custom prompts for business use cases without development.

The model is also integrated with the Mistral Search Toolkit for structured document search and RAG pipelines.


Pricing and availability

OCR 4 is available under three pricing models :

  • API : 4$ / 1 000 pages
  • Batch API : –50% at 2$ / 1 000 pages
  • Document AI : 5$ / 1 000 pages

OCR 4 is available via :

  • Mistral API and Mistral Studio : Direct access as of June 23, 2026
  • Amazon SageMaker : Available on the Marketplace
  • Microsoft Foundry : Integration available
  • Snowflake : Parse Document integration in rollout
  • Self-hosting : Option available for environments with data privacy constraints

Use cases
  • Invoices, purchase orders, KYC : Structured extraction with targeted human validation powered by confidence scores and layout preservation.
  • Document RAG : Citation-ready, block-structured content to feed precise knowledge bases.
  • Agentic workflows : Automated form and invoice processing in end-to-end AI pipelines.
  • Enterprise search : Large-scale indexing of complex multilingual documents.

Customer testimonials

Several partners shared their results :

  • Anaqua : Ivan Mihailov states that Mistral OCR 4 is “roughly 4x faster per page than their incumbent provider”.
  • Rogo : Aidan Donohue reports reaching “equivalent accuracy at roughly 8x lower cost” compared to the parsers they benchmarked.

image


Why now ?

Mistral positions OCR 4 as an accelerator for AI adoption in document-heavy enterprise environments. As long as critical documents on paper and PDF remain unstructured, AI use cases (agents, analytics, automation) hit a wall at extraction. Bounding boxes and confidence scores make it possible to build more reliable pipelines, with human review reserved for low-confidence passages.


How to get started ?
  • Test in Mistral Studio (PDF/Image -> text/JSON) to validate quality on your own documents.
  • Prototype via the API in pure extraction mode ; enable the Batch API for high volumes (cost ÷2).
  • Explore Document AI for business use cases without development using JSON schema structuring.
  • Contact Mistral for self-hosting if your data privacy constraints require it.

Conclusion

With OCR 4, Mistral delivers a multilingual, multi-format OCR solution suited to enterprise document requirements. Bounding boxes, block classification, and confidence scores open the door to more precise, more controllable extraction pipelines adapted to human-in-the-loop validation workflows.


Sources

Mistral AI - OCR 4

Test Le Chat by Mistral AI


Did you enjoy this post ? If you have any questions, comments or suggestions, please feel free to send me a message from the contact form.

Don’t forget to follow us and share this post.

Related Posts

Mistral Large 24.11 transforms industries with cutting-edge AI

Mistral Large 24.11 transforms industries with cutting-edge AI

Introduction Microsoft recently announced the release of Mistral Large 24.11, an advanced language model (LLM) available in the Azure AI model catalog. This new version sets a new benchma

Read More
Mistral OCR, new benchmark in character recognition

Mistral OCR, new benchmark in character recognition

Introduction In March 2025, Mistral AI announced the launch of Mistral OCR, an optical character recognition (OCR) API that sets a new standard in document understanding. This advance

Read More
Anthropic introduces Claude 4, the more powerful and durable AI

Anthropic introduces Claude 4, the more powerful and durable AI

Introduction In an artificial intelligence market dominated by OpenAI, Google, and Microsoft, Anthropic continues to forge its own path. With the launch of the Claude 4 family, th

Read More
Mistral OCR 3, a precise, structured and affordable OCR

Mistral OCR 3, a precise, structured and affordable OCR

Introduction In December 2025, Mistral AI announced the launch of Mistral OCR version 3, an Optical Character Recognition (OCR) API that sets a new standard for document understanding

Read More
Mistral Voxtral Transcribe2, real-time transcription

Mistral Voxtral Transcribe2, real-time transcription

Introduction Mistral AI has just unveiled Voxtral Transcribe 2, its second generation of speech transcription models with cutting-edge transcription quality, ultra-low latency and advan

Read More
How to download Cisco Webex recorded calls via API

How to download Cisco Webex recorded calls via API

Introduction Cisco Webex Contact Center offers advanced call recording capabilities, essential for quality, compliance, and continuous service improvement. Supervisors can easily listen t

Read More
Microsoft SharePoint celebrates 25 years of evolution

Microsoft SharePoint celebrates 25 years of evolution

Introduction Microsoft has announced the 25th anniversary of Microsoft SharePoint, a platform that now plays a central role in Microsoft 365. Since its initial release in 2001, SharePoi

Read More
Mistral Small 4, one model for reasoning, visioning, and coding

Mistral Small 4, one model for reasoning, visioning, and coding

Introduction Managing multiple specialized models within a single AI pipeline adds deployment complexity and multiplies infrastructure costs. Mistral AI announced on March 16, 2026 the la

Read More
Voice-native agents in Foundry in Public Preview

Voice-native agents in Foundry in Public Preview

Introduction Microsoft announced on March 16, 2026 the Public Preview of Voice Native Agents in Microsoft Azure AI Foundry, a native combination of the Voice Live API and the *Found

Read More
Anthropic unveils Claude Opus 4.7, with a new tokenizer

Anthropic unveils Claude Opus 4.7, with a new tokenizer

Introduction Anthropic announced on April 16, 2026 the general availability of Claude Opus 4.7, the direct successor to [Claude Opus 4.6](https://maxime.hiez.ca/en/blog/2026-02-13-ai-an

Read More
Anthropic releases Claude Opus 4.8

Anthropic releases Claude Opus 4.8

Introduction Anthropic made Claude Opus 4.8 generally available on May 28, 2026, as the direct successor to Claude Opus 4.7. Pricing remains unchanged at 5$ / 1M input tokens and 25$

Read More
Anthropic launches Claude Fable 5 and Claude Mythos 5

Anthropic launches Claude Fable 5 and Claude Mythos 5

Introduction Anthropic announces the launch of Claude Fable 5 and Claude Mythos 5, two models built on the same foundation, with capabilities surpassing those of any previously releas

Read More
Anthropic unveils Claude Sonnet 5, outperforming Opus 4.8

Anthropic unveils Claude Sonnet 5, outperforming Opus 4.8

Introduction Anthropic announced on June 30, 2026, the availability of Claude Sonnet 5, now the default model on the Free and Pro plans of Claude.ai. The model is designed for com

Read More