Anthropic unveils Claude Sonnet 4.5, more advanced

Maxime Hiez
Anthropic
30 Oct, 2025

Introduction

Anthropic, a leading player in artificial intelligence, has announced the release of Claude Sonnet 4.5, touted as the world’s best coding model and a significant leap forward for building autonomous agents and AI’s use of computers. The release is accompanied by a series of product enhancements (Claude Code, VS Code extension, checkpoints, Agent SDK) and a suite of tools to enable developers to leverage the new capabilities. The company emphasizes coding performance, long-term endurance, and improved alignment and security.

What Sonnet 4.5 offers

Coding performance : Sonnet 4.5 dominates the SWE-bench Verified benchmarks and shows significant gains on real-world programming and code editing tasks.
Endurance : Anthropic reports that the model can maintain focus on long tasks—over 30 hours on multi-step scenarios—a game-changer for persistent agents.
Computer utilization : Sonnet 4.5 makes significant progress on OSWorld (a real-world computer usage benchmark), now reaching 61.4% compared to 42.2% a few months earlier.
Ecosystem and product features : Checkpoints in Claude Code, a refreshed terminal editor, a native VS Code extension, code execution and file creation directly within the Claude conversation, and the availability of Claude for Chrome for select users.

New product features

Checkpoints & developer experience

Claude Code receives checkpoints—state saves that allow instant reversion to a previous point—and a redesigned terminal. These elements facilitate iterative experimentation and reduce the risk of work loss during long agent-coding sessions.

Context editing & memory tool for agents

The new context editing feature and memory tool in the API allow agents to handle even longer and more complex tasks by maintaining and modifying context in a structured way. This is a key driver for the model’s promised endurance.

Claude Agent SDK

Anthropic publishes the Claude Agent SDK, the infrastructure used to build Claude Code. The SDK provides primitives for memory management, sub-agent coordination, and permission systems—essential building blocks for creating robust agents in production.

Imagine with Claude

A research preview, Imagine with Claude, showcases the model generating software in real time (no pre-written code) — a demonstration of Sonnet 4.5’s ability to create tools and applications on the fly. This experiment was temporarily made available to Max subscribers.

Performance and benchmarks

Anthropic publishes detailed results :

SWE-bench Verified : Sonnet 4.5 achieves top scores (reported tests indicate 77.2% under certain configurations), and internal procedures (parallel sampling, replay, and internal scoring) optimize results for high-compute configurations.
OSWorld : Major progress on using a computer tasks (currently 61.4%), reflecting the ability to navigate, complete spreadsheets, and execute complex sequences of actions.
Gains were also assessed in reasoning, mathematics, and specialized performance for finance, law, medicine, and STEM, based on internal evaluations and customer feedback.

Safety and alignment : ASL-3 and classifiers

Anthropic positions Sonnet 4.5 as the most aligned frontier model to date :

Reduction of problematic behaviors (sycophancy, deception, power-seeking, encouragement of delusions).
ASL-3 Mechanisms : Sonnet 4.5 is deployed under the AI Safety Level 3 framework, with classifiers designed to detect potentially dangerous inputs/outputs (including CBRN risks). These safeguards can sometimes generate false positives; however, Anthropic indicates that it has reduced these false positives by a factor of 10 since their initial description, and by a factor of 2 since Opus 4.
Mitigation : When a conversation is interrupted by a classifier, Anthropic offers to continue on Sonnet 4 (less sensitive) and provides allowlist processes for industries with specific needs (cybersecurity, biological research).

Availability and pricing

Availability : Sonnet 4.5 is available everywhere starting today via the Claude API (claude-sonnet-4-5) and integrated into products (Claude Code, Claude apps).
Partner Platforms : Amazon Bedrock, Google Vertex AI, GitHub Copilot (public preview), Vercel, etc. — broad distribution to facilitate enterprise integration.
Pricing : Anthropic indicates that the price remains unchanged from Sonnet 4: 3$ / 15$ per million tokens (depending on the announced pricing configuration).

Note : Prices in USD before applicable taxes.

Limitations & points of consideration

False positives from classifiers : Although reduced, they can disrupt legitimate use cases and require operational workflows (fallback, allowlist).
Cost & integration : Intensive use (1 million token contexts, continuous agent execution) requires careful consideration of costs and architecture.
Production testing : Lab gains must be validated in your own business scenarios (CI/CD, pipelines, codebase complexity).

Practical recommendations

First, pilot coding cases (test automation, skeleton generation, code review) to measure the gains.
Leverage the Claude Agent SDK to prototype controlled agents (memory management, permissions).
Plan interrupt handling (classifiers) : fallback workflows, allowlists for sensitive areas.
Monitor costs and contextual configurations (200K vs. 1M tokens) based on contextual memory requirements.

Conclusion

Claude Sonnet 4.5 represents a significant milestone for Anthropic: a model focused on coding, agency, and the extended use of a computer by AI, delivered with product tools and an SDK to industrialize these capabilities. The model combines performance gains, extended endurance, and enhanced security mechanisms (ASL-3 and classifiers). For engineering teams and organizations looking to automate complex workflows or deploy AI agents in production, Sonnet 4.5 is a serious option—one that should be managed with careful consideration of integration constraints, costs, and mechanisms for mitigating security disruptions.

Sources

Anthropic - Claude Sonnet 4.5

Chat with Claude Sonnet 4.5

Did you enjoy this post ? If you have any questions, comments or suggestions, please feel free to send me a message from the contact form.

Don’t forget to follow us and share this post.

Tags :

Nearly 70% of Fortune 500 companies use Copilot

Maxime Hiez
Copilot
20 Nov, 2024

Introduction At Microsoft Ignite 2024, Microsoft highlighted why nearly 70% of Fortune 500 companies now use Microsoft 365 Copilot. This mass adoption reflects a growing trend in the indu

How to disable self-service on Copilot licenses

Introduction Microsoft has activated a setting in the tenants (by default) to allow any user to purchase a Microsoft Copilot license through the *Microsoft 365 Copilot self-service pursha

Improved Teams video quality with Super Resolution

Maxime Hiez
Teams
06 Feb, 2025

Introduction Microsoft continues to innovate to provide users with the best possible virtual communication experience. One of the latest advancements is the introduction of *Super Resolutio

Le Chat by Mistral AI, your personal AI assistant

Maxime Hiez
Mistral AI
10 Feb, 2025

Introduction I told you last December about the French AI, Mistral AI, the most popular model in Europe in which Microsoft invested 15 million euros in the startup. The mobile app has jus

New Yealink MeetingBoard 65 and 85 for Teams rooms

Maxime Hiez
MTR
13 Feb, 2025

Introduction The new Yealink MeetingBoard 65 and 85 are an innovative and comprehensive solution designed to transform meeting rooms into intelligent collaboration spaces. These all-in-on

Maximize the use of the Copilot prompt gallery

Maxime Hiez
Copilot
19 Feb, 2025

Introduction Microsoft 365 Copilot continues to revolutionize the way organizations work by integrating advanced artificial intelligence capabilities into everyday tools. One of the key f

How to get started with Copilot in Excel

Maxime Hiez
Copilot
20 Feb, 2025

Introduction Microsoft 365 Copilot is a major innovation that integrates artificial intelligence directly into the applications you use every day, like Excel. Copilot helps you automate t

Microsoft Purview for Azure Data Lake and Blob Storage

Maxime Hiez
Purview
21 Feb, 2025

Introduction Microsoft announced that Microsoft Purview protection policies for Azure Data Lake and Blob Storage are now available in all regions. This advancement allows organization

Facilitator, new AI agent for taking notes in meetings

Maxime Hiez
MTR
08 Mar, 2025

Introduction Microsoft recently announced a new feature for Teams Rooms: Facilitator ; an AI agent that takes notes during Teams meetings. This feature is currently in pre-public release

Enterprise Connect 2025 : Yealink SkySound CM50 Dante kit

Maxime Hiez
MTR
20 Mar, 2025

Introduction Enterprise Connect is an annual conference that brings together communications technology professionals, innovators, and others. This event showcases technological advances i

Mistral OCR, new benchmark in character recognition

Maxime Hiez
Mistral AI
18 Apr, 2025

Introduction In March 2025, Mistral AI announced the launch of Mistral OCR, an optical character recognition (OCR) API that sets a new standard in document understanding. This advance

Introducing the Logitech Rally Board 65

Maxime Hiez
MTR
28 Apr, 2025

Introduction The Logitech Rally Board 65 is an all-in-one video conferencing solution designed to simplify meetings and collaboration in business environments. With its 65-inch touchscree

Mistral Code, the European AI development assistant

Maxime Hiez
Mistral AI
09 Jun, 2025

Introduction French startup Mistral AI, already recognized for its open source language models, has just unveiled Mistral Code, an intelligent development assistant designed for businesse

Anthropic introduces Claude 4, the more powerful and durable AI

Maxime Hiez
Anthropic
27 Jun, 2025

Introduction In an artificial intelligence market dominated by OpenAI, Google, and Microsoft, Anthropic continues to forge its own path. With the launch of the Claude 4 family, th

New Yealink MeetingBar A50 for Teams Rooms

Maxime Hiez
MTR
16 Jul, 2025

Introduction In an increasingly hybrid work world, businesses are looking for video conferencing solutions that are powerful, easy to deploy, and seamlessly integrated into their *Microsoft

Mercedes-Benz, your car becomes a rolling office

Maxime Hiez
Teams
21 Jul, 2025

Introduction In an automotive market increasingly focused on smart and connected mobility, Mercedes-Benz is taking a giant leap forward. With the new generation of the CLA model, the Ge

Anthropic unveils Claude Opus 4.1, faster and more reliable

Maxime Hiez
Anthropic
08 Aug, 2025

Introduction Anthropic, a leading player in artificial intelligence, has announced the release of Claude Opus 4.1, a significant update to its flagship model (Claude Opus 4). Designed

OpenAI unveils GPT-5, its latest smarter model

Maxime Hiez
OpenAI
11 Aug, 2025

Introduction OpenAI has taken another step forward in the evolution of artificial intelligence with the launch of GPT-5, its most powerful language model to date. Designed to be smarter

What's new for Copilot in August 2025

Maxime Hiez
Copilot
03 Sep, 2025

Introduction Microsoft releases a monthly update to Microsoft 365 Copilot to keep admins and users up-to-date on productivity-enhancing features in Microsoft 365. The August 2025 release

How to enable DSPM for AI with Purview

Introduction With the rise of generative AI models, the phenomenon of Shadow AI (the use of artificial intelligence tools and services not approved or controlled by organizations) is incr

How to add a disclaimer in Copilot

Introduction Microsoft has enabled a setting in tenants that allows administrators to display the Microsoft 365 Copilot disclaimer in bold, and to attach a shortcut pointing to a usage po

Mistral Voxtral Transcribe2, real-time transcription

Maxime Hiez
Mistral AI
05 Feb, 2026

Introduction Mistral AI has just unveiled Voxtral Transcribe 2, its second generation of speech transcription models with cutting-edge transcription quality, ultra-low latency and advan

How to enable DLP for AI websites with Purview

Introduction Last week, I showed you how to enable DLP to prevent printing of financial data using Microsoft Purview, in order to prevent accidental or malicious data leaks (*Data Loss

Anthropic unveils Claude Opus 4.6, a benchmark for finance

Maxime Hiez
Anthropic
13 Feb, 2026

Introduction Artificial intelligence is rapidly growing in the finance industry, but one reality remains : real-world financial analyses are rarely clean, linear, or perfectly defined. They

How to enable Claude AI as a model in Copilot

Introduction Since its launch, Microsoft 365 Copilot has established itself as a cornerstone of enhanced enterprise productivity, leveraging advanced AI models to reason, analyze, and aut