Mistral OCR, new benchmark in character recognition

Maxime Hiez
Mistral AI
18 Apr, 2025

Introduction

In March 2025, Mistral AI announced the launch of Mistral OCR, an optical character recognition (OCR) API that sets a new standard in document understanding. This advanced technology enables complex documents to be processed and transcribed with unparalleled accuracy and speed, delivering document understanding capabilities at a level never before achieved.

Mistral OCR key features

Complex document understanding

Mistral OCR excels at understanding complex document elements, including interleaved images, mathematical expressions, tables, and advanced layouts such as LaTeX formatting. The model enables in-depth understanding of rich documents such as scientific articles with graphs, equations, and figures.

Multilingual and multimodal

The model is natively multilingual and multimodal, meaning it can process documents in multiple languages and formats. It supports PDFs, images, and uploaded documents, and can extract structured content while preserving the document hierarchy and formatting.

Top-Notch performance

Mistral OCR has consistently outperformed other leading OCR models in rigorous benchmark tests. Its superior document analysis accuracy is demonstrated by its ability to extract embedded images as well as text. Results are returned in Markdown format for easy analysis and rendering.

Mistral OCR highlights

Complex document understanding
Natively multilingual and multimodal
Best-in-class references
Fastest in its class
Structured and rapid output
Selectively available for self-hosting for organizations handling highly sensitive or classified information

Comparison with other OCR models

Mistral OCR stands out for its ability to understand and transcribe complex documents with unparalleled accuracy. Unlike other OCR models, Mistral OCR can handle multimodal and multilingual elements, offering a complete solution for document understanding.

Model	Overall	Math	Multilingual	Scanned	Tables
Google Document AI	83.42	80.29	86.42	92.77	78.16
Azure OCR	89.52	85.72	87.52	94.65	89.52
Gemini-1.5-Flash-00	90.23	89.11	86.76	94.87	90.48
Gemini-1.5-Pro-002	89.92	88.48	86.33	96.15	89.71
Gemini-2.0-Flash-00	88.69	84.18	85.80	95.11	91.46
GPT-4o-2024-11-20	89.77	87.55	86.00	94.58	91.70
Mistral OCR 2503	94.89	94.29	89.55	98.96	96.12

Using Mistral OCR

Mistral OCR is available via the mistral-ocr-latest API, offering a processing capacity of 1000 pages per dollar, and approximately twice as many pages per dollar in batches. The API is accessible today on the Platform development suite.

Conclusion

Mistral OCR represents a significant advancement in optical character recognition, offering a new level of document understanding capabilities. With its accuracy, speed, and multilingual and multimodal versatility, Mistral OCR is ideal for organizations seeking to harness the potential of unstructured information.

Sources

Mistral AI - OCR

Test Le Chat by Mistral AI

Did you enjoy this post ? If you have any questions, comments or suggestions, please feel free to send me a message from the contact form.

Don’t forget to follow us and share this post.

Tags :

Nearly 70% of Fortune 500 companies use Copilot

Maxime Hiez
Copilot
20 Nov, 2024

Introduction At Microsoft Ignite 2024, Microsoft highlighted why nearly 70% of Fortune 500 companies now use Microsoft 365 Copilot. This mass adoption reflects a growing trend in the indu

How to disable self-service on Copilot licenses

Introduction Microsoft has activated a setting in the tenants (by default) to allow any user to purchase a Microsoft Copilot license through the *Microsoft 365 Copilot self-service pursha

Mistral Large 24.11 transforms industries with cutting-edge AI

Maxime Hiez
Mistral AI
15 Dec, 2024

Introduction Microsoft recently announced the release of Mistral Large 24.11, an advanced language model (LLM) available in the Azure AI model catalog. This new version sets a new benchma

Improved Teams video quality with Super Resolution

Maxime Hiez
Teams
06 Feb, 2025

Introduction Microsoft continues to innovate to provide users with the best possible virtual communication experience. One of the latest advancements is the introduction of *Super Resolutio

Le Chat by Mistral AI, your personal AI assistant

Maxime Hiez
Mistral AI
10 Feb, 2025

Introduction I told you last December about the French AI, Mistral AI, the most popular model in Europe in which Microsoft invested 15 million euros in the startup. The mobile app has jus

New Yealink MeetingBoard 65 and 85 for Teams rooms

Maxime Hiez
MTR
13 Feb, 2025

Introduction The new Yealink MeetingBoard 65 and 85 are an innovative and comprehensive solution designed to transform meeting rooms into intelligent collaboration spaces. These all-in-on

Maximize the use of the Copilot prompt gallery

Maxime Hiez
Copilot
19 Feb, 2025

Introduction Microsoft 365 Copilot continues to revolutionize the way organizations work by integrating advanced artificial intelligence capabilities into everyday tools. One of the key f

How to get started with Copilot in Excel

Maxime Hiez
Copilot
20 Feb, 2025

Introduction Microsoft 365 Copilot is a major innovation that integrates artificial intelligence directly into the applications you use every day, like Excel. Copilot helps you automate t

Microsoft Purview for Azure Data Lake and Blob Storage

Maxime Hiez
Purview
21 Feb, 2025

Introduction Microsoft announced that Microsoft Purview protection policies for Azure Data Lake and Blob Storage are now available in all regions. This advancement allows organization

Facilitator, new AI agent for taking notes in meetings

Maxime Hiez
MTR
08 Mar, 2025

Introduction Microsoft recently announced a new feature for Teams Rooms: Facilitator ; an AI agent that takes notes during Teams meetings. This feature is currently in pre-public release

Enterprise Connect 2025 : Yealink SkySound CM50 Dante kit

Maxime Hiez
MTR
20 Mar, 2025

Introduction Enterprise Connect is an annual conference that brings together communications technology professionals, innovators, and others. This event showcases technological advances i

Introducing the Logitech Rally Board 65

Maxime Hiez
MTR
28 Apr, 2025

Introduction The Logitech Rally Board 65 is an all-in-one video conferencing solution designed to simplify meetings and collaboration in business environments. With its 65-inch touchscree

Mistral Code, the European AI development assistant

Maxime Hiez
Mistral AI
09 Jun, 2025

Introduction French startup Mistral AI, already recognized for its open source language models, has just unveiled Mistral Code, an intelligent development assistant designed for businesse

New Yealink MeetingBar A50 for Teams Rooms

Maxime Hiez
MTR
16 Jul, 2025

Introduction In an increasingly hybrid work world, businesses are looking for video conferencing solutions that are powerful, easy to deploy, and seamlessly integrated into their *Microsoft

Mercedes-Benz, your car becomes a rolling office

Maxime Hiez
Teams
21 Jul, 2025

Introduction In an automotive market increasingly focused on smart and connected mobility, Mercedes-Benz is taking a giant leap forward. With the new generation of the CLA model, the Ge

Anthropic unveils Claude Opus 4.1, faster and more reliable

Maxime Hiez
Anthropic
08 Aug, 2025

Introduction Anthropic, a leading player in artificial intelligence, has announced the release of Claude Opus 4.1, a significant update to its flagship model (Claude Opus 4). Designed

OpenAI unveils GPT-5, its latest smarter model

Maxime Hiez
OpenAI
11 Aug, 2025

Introduction OpenAI has taken another step forward in the evolution of artificial intelligence with the launch of GPT-5, its most powerful language model to date. Designed to be smarter

What's new for Copilot in August 2025

Maxime Hiez
Copilot
03 Sep, 2025

Introduction Microsoft releases a monthly update to Microsoft 365 Copilot to keep admins and users up-to-date on productivity-enhancing features in Microsoft 365. The August 2025 release

Anthropic unveils Claude Sonnet 4.5, more advanced

Maxime Hiez
Anthropic
30 Oct, 2025

Introduction Anthropic, a leading player in artificial intelligence, has announced the release of Claude Sonnet 4.5, touted as the world's best coding model and a significant leap for

How to enable DSPM for AI with Purview

Introduction With the rise of generative AI models, the phenomenon of Shadow AI (the use of artificial intelligence tools and services not approved or controlled by organizations) is incr

Mistral OCR 3, a precise, structured and affordable OCR

Maxime Hiez
Mistral AI
15 Jan, 2026

Introduction In December 2025, Mistral AI announced the launch of Mistral OCR version 3, an Optical Character Recognition (OCR) API that sets a new standard for document understanding

How to add a disclaimer in Copilot

Introduction Microsoft has enabled a setting in tenants that allows administrators to display the Microsoft 365 Copilot disclaimer in bold, and to attach a shortcut pointing to a usage po

Extend Zero Trust to AI agent identities in Entra ID

Maxime Hiez
Entra ID
30 Jan, 2026

Introduction AI agents are becoming increasingly widespread in businesses (incident summaries, log analysis, flow execution, etc.), and it is crucial that their access is continuously evalu

Mistral Voxtral Transcribe2, real-time transcription

Maxime Hiez
Mistral AI
05 Feb, 2026

Introduction Mistral AI has just unveiled Voxtral Transcribe 2, its second generation of speech transcription models with cutting-edge transcription quality, ultra-low latency and advan

How to enable DLP for AI websites with Purview

Introduction Last week, I showed you how to enable DLP to prevent printing of financial data using Microsoft Purview, in order to prevent accidental or malicious data leaks (*Data Loss

Anthropic unveils Claude Opus 4.6, a benchmark for finance

Maxime Hiez
Anthropic
13 Feb, 2026

Introduction Artificial intelligence is rapidly growing in the finance industry, but one reality remains : real-world financial analyses are rarely clean, linear, or perfectly defined. They

How to enable Claude AI as a model in Copilot

Introduction Since its launch, Microsoft 365 Copilot has established itself as a cornerstone of enhanced enterprise productivity, leveraging advanced AI models to reason, analyze, and aut

OpenAI unveils GPT-5.4, the new generation of models

Maxime Hiez
OpenAI
09 Mar, 2026

Introduction OpenAI has just announced GPT-5.4, a new evolution of its GPT model family. Designed for professional uses and complex tasks, this model introduces several major improvemen

Introducing Microsoft 365 E7, the Frontier Suite

Maxime Hiez
Microsoft 365
10 Mar, 2026

Introduction Microsoft has announced the availability of the Microsoft 365 E7 license, a new offer called Frontier Suite, designed for the era of AI-driven work and agents. This announc

How to download Cisco Webex recorded calls via API

Introduction Cisco Webex Contact Center offers advanced call recording capabilities, essential for quality, compliance, and continuous service improvement. Supervisors can easily listen t

OpenAI unveils GPT-5.5, designed for agentic work

Maxime Hiez
OpenAI
28 Apr, 2026

Introduction OpenAI has just announced GPT-5.5, barely seven weeks after GPT-5.4. The message is clear, GPT-5.5 is not an incremental update but *"a new class of intelligence for real

Mistral Small 4, one model for reasoning, visioning, and coding

Maxime Hiez
Mistral AI
30 Apr, 2026

Introduction Managing multiple specialized models within a single AI pipeline adds deployment complexity and multiplies infrastructure costs. Mistral AI announced on March 16, 2026 the la

Voice-native agents in Foundry in Public Preview

Maxime Hiez
Foundry
05 May, 2026

Introduction Microsoft announced on March 16, 2026 the Public Preview of Voice Native Agents in Microsoft Azure AI Foundry, a native combination of the Voice Live API and the *Found

Anthropic unveils Claude Opus 4.7, with a new tokenizer

Maxime Hiez
Anthropic
12 May, 2026

Introduction Anthropic announced on April 16, 2026 the General Availability of Claude Opus 4.7, the direct successor to [Claude Opus 4.6](https://maxime.hiez.ca/en/blog/2026-02-13-ai-

GPT-5.5 Instant now available in Copilot

Maxime Hiez
Copilot
14 May, 2026

Introduction Microsoft 365 Copilot is accelerating the renewal cadence of its underlying models ; this is the fifth iteration of the GPT-5.x series deployed in less than a year. On 7 May 20

Consecutive bilingual interpretation in Teams

Maxime Hiez
Teams
26 May, 2026

Introduction The Interpreter agent in Microsoft Teams previously offered only a simultaneous translation mode, where the synthesized voice overlapped with the speaker's voice in real ti

Anthropic releases Claude Opus 4.8

Maxime Hiez
Anthropic
28 May, 2026

Introduction Anthropic made Claude Opus 4.8 generally available on May 28, 2026, as the direct successor to Claude Opus 4.7. Pricing remains unchanged at 5$ / 1M input tokens and 25$

Claude Opus 4.8 available in Microsoft Foundry

Maxime Hiez
Foundry
02 Jun, 2026

Introduction Microsoft made Claude Opus 4.8 available in Microsoft Foundry in preview, a few days after its general release by Anthropic on May 28, 2026. The model is accessible via

Anthropic launches Claude Fable 5 and Claude Mythos 5

Maxime Hiez
Anthropic
10 Jun, 2026

Introduction Anthropic announces the launch of Claude Fable 5 and Claude Mythos 5, two models built on the same foundation, with capabilities surpassing those of any previously releas

Mistral OCR 4, an OCR model for document analysis

Maxime Hiez
Mistral AI
25 Jun, 2026

Introduction Mistral AI announced Mistral OCR 4 on June 23, 2026, its next-generation optical character recognition model. The model introduces bounding boxes, typed block classificatio

OpenAI unveils GPT-5.6 Sol, Terra and Luna in preview

Maxime Hiez
OpenAI
27 Jun, 2026

Introduction OpenAI announced on June 26, 2026 a preview of three new models : GPT-5.6 Sol, GPT-5.6 Terra and GPT-5.6 Luna. Access is limited to approximately twenty trusted partner

Anthropic unveils Claude Sonnet 5, outperforming Opus 4.8

Maxime Hiez
Anthropic
01 Jul, 2026

Introduction Anthropic announced on June 30, 2026, the availability of Claude Sonnet 5, now the default model on the Free and Pro plans of Claude.ai. The model is designed for com