Type something to search...
Anthropic unveils Claude Sonnet 4.5, more advanced

Anthropic unveils Claude Sonnet 4.5, more advanced


Introduction

Anthropic, a leading player in artificial intelligence, has announced the release of Claude Sonnet 4.5, touted as the world’s best coding model and a significant leap forward for building autonomous agents and AI’s use of computers. The release is accompanied by a series of product enhancements (Claude Code, VS Code extension, checkpoints, Agent SDK) and a suite of tools to enable developers to leverage the new capabilities. The company emphasizes coding performance, long-term endurance, and improved alignment and security.


What Sonnet 4.5 offers
  • Coding performance : Sonnet 4.5 dominates the SWE-bench Verified benchmarks and shows significant gains on real-world programming and code editing tasks.
  • Endurance : Anthropic reports that the model can maintain focus on long tasks—over 30 hours on multi-step scenarios—a game-changer for persistent agents.
  • Computer utilization : Sonnet 4.5 makes significant progress on OSWorld (a real-world computer usage benchmark), now reaching 61.4% compared to 42.2% a few months earlier.
  • Ecosystem and product features : Checkpoints in Claude Code, a refreshed terminal editor, a native VS Code extension, code execution and file creation directly within the Claude conversation, and the availability of Claude for Chrome for select users.

image


New product features

Checkpoints & developer experience

Claude Code receives checkpoints—state saves that allow instant reversion to a previous point—and a redesigned terminal. These elements facilitate iterative experimentation and reduce the risk of work loss during long agent-coding sessions.


Context editing & memory tool for agents

The new context editing feature and memory tool in the API allow agents to handle even longer and more complex tasks by maintaining and modifying context in a structured way. This is a key driver for the model’s promised endurance.


Claude Agent SDK

Anthropic publishes the Claude Agent SDK, the infrastructure used to build Claude Code. The SDK provides primitives for memory management, sub-agent coordination, and permission systems—essential building blocks for creating robust agents in production.


Imagine with Claude

A research preview, Imagine with Claude, showcases the model generating software in real time (no pre-written code) — a demonstration of Sonnet 4.5’s ability to create tools and applications on the fly. This experiment was temporarily made available to Max subscribers.


Performance and benchmarks

Anthropic publishes detailed results :

  • SWE-bench Verified : Sonnet 4.5 achieves top scores (reported tests indicate 77.2% under certain configurations), and internal procedures (parallel sampling, replay, and internal scoring) optimize results for high-compute configurations.
  • OSWorld : Major progress on using a computer tasks (currently 61.4%), reflecting the ability to navigate, complete spreadsheets, and execute complex sequences of actions.
  • Gains were also assessed in reasoning, mathematics, and specialized performance for finance, law, medicine, and STEM, based on internal evaluations and customer feedback.

image


image


Safety and alignment : ASL-3 and classifiers

Anthropic positions Sonnet 4.5 as the most aligned frontier model to date :

  • Reduction of problematic behaviors (sycophancy, deception, power-seeking, encouragement of delusions).
  • ASL-3 Mechanisms : Sonnet 4.5 is deployed under the AI ​​Safety Level 3 framework, with classifiers designed to detect potentially dangerous inputs/outputs (including CBRN risks). These safeguards can sometimes generate false positives; however, Anthropic indicates that it has reduced these false positives by a factor of 10 since their initial description, and by a factor of 2 since Opus 4.
  • Mitigation : When a conversation is interrupted by a classifier, Anthropic offers to continue on Sonnet 4 (less sensitive) and provides allowlist processes for industries with specific needs (cybersecurity, biological research).

image


Availability and pricing
  • Availability : Sonnet 4.5 is available everywhere starting today via the Claude API (claude-sonnet-4-5) and integrated into products (Claude Code, Claude apps).
  • Partner Platforms : Amazon Bedrock, Google Vertex AI, GitHub Copilot (public preview), Vercel, etc. — broad distribution to facilitate enterprise integration.
  • Pricing : Anthropic indicates that the price remains unchanged from Sonnet 4: 3$ / 15$ per million tokens (depending on the announced pricing configuration).

Note : Prices in USD before applicable taxes.


Limitations & points of consideration
  • False positives from classifiers : Although reduced, they can disrupt legitimate use cases and require operational workflows (fallback, allowlist).
  • Cost & integration : Intensive use (1 million token contexts, continuous agent execution) requires careful consideration of costs and architecture.
  • Production testing : Lab gains must be validated in your own business scenarios (CI/CD, pipelines, codebase complexity).

Practical recommendations
  1. First, pilot coding cases (test automation, skeleton generation, code review) to measure the gains.
  2. Leverage the Claude Agent SDK to prototype controlled agents (memory management, permissions).
  3. Plan interrupt handling (classifiers) : fallback workflows, allowlists for sensitive areas.
  4. Monitor costs and contextual configurations (200K vs. 1M tokens) based on contextual memory requirements.

Conclusion

Claude Sonnet 4.5 represents a significant milestone for Anthropic: a model focused on coding, agency, and the extended use of a computer by AI, delivered with product tools and an SDK to industrialize these capabilities. The model combines performance gains, extended endurance, and enhanced security mechanisms (ASL-3 and classifiers). For engineering teams and organizations looking to automate complex workflows or deploy AI agents in production, Sonnet 4.5 is a serious option—one that should be managed with careful consideration of integration constraints, costs, and mechanisms for mitigating security disruptions.


Sources

Anthropic - Claude Sonnet 4.5

Chat with Claude Sonnet 4.5


Did you enjoy this post ? If you have any questions, comments or suggestions, please feel free to send me a message from the contact form.

Don’t forget to follow us and share this post.

Related Posts

Nearly 70% of Fortune 500 companies use Copilot

Nearly 70% of Fortune 500 companies use Copilot

Introduction At Microsoft Ignite 2024, Microsoft highlighted why nearly 70% of Fortune 500 companies now use Microsoft 365 Copilot. This mass adoption reflects a growing trend in the indu

Read More
How to disable self-service on Copilot licenses

How to disable self-service on Copilot licenses

Introduction Microsoft has activated a setting in the tenants (by default) to allow any user to purchase a Microsoft Copilot license through the *Microsoft 365 Copilot self-service pursha

Read More
Improved Teams video quality with Super Resolution

Improved Teams video quality with Super Resolution

Introduction Microsoft continues to innovate to provide users with the best possible virtual communication experience. One of the latest advancements is the introduction of *Super Resolutio

Read More
Le Chat by Mistral AI, your personal AI assistant

Le Chat by Mistral AI, your personal AI assistant

Introduction I told you last December about the French AI, Mistral AI, the most popular model in Europe in which Microsoft invested 15 million euros in the startup. The mobile app has jus

Read More
New Yealink MeetingBoard 65 and 85 for Teams rooms

New Yealink MeetingBoard 65 and 85 for Teams rooms

Introduction The new Yealink MeetingBoard 65 and 85 are an innovative and comprehensive solution designed to transform meeting rooms into intelligent collaboration spaces. These all-in-on

Read More
Maximize the use of the Copilot prompt gallery

Maximize the use of the Copilot prompt gallery

Introduction Microsoft 365 Copilot continues to revolutionize the way organizations work by integrating advanced artificial intelligence capabilities into everyday tools. One of the key f

Read More
How to get started with Copilot in Excel

How to get started with Copilot in Excel

Introduction Microsoft 365 Copilot is a major innovation that integrates artificial intelligence directly into the applications you use every day, like Excel. Copilot helps you automate t

Read More
Microsoft Purview for Azure Data Lake and Blob Storage

Microsoft Purview for Azure Data Lake and Blob Storage

Introduction Microsoft announced that Microsoft Purview protection policies for Azure Data Lake and Blob Storage are now available in all regions. This advancement allows organization

Read More
Facilitator, new AI agent for taking notes in meetings

Facilitator, new AI agent for taking notes in meetings

Introduction Microsoft recently announced a new feature for Teams Rooms: Facilitator ; an AI agent that takes notes during Teams meetings. This feature is currently in pre-public release

Read More
Enterprise Connect 2025 : Yealink SkySound CM50 Dante kit

Enterprise Connect 2025 : Yealink SkySound CM50 Dante kit

Introduction Enterprise Connect is an annual conference that brings together communications technology professionals, innovators, and others. This event showcases technological advances i

Read More
Mistral OCR, new benchmark in character recognition

Mistral OCR, new benchmark in character recognition

Introduction In March 2025, Mistral AI announced the launch of Mistral OCR, an optical character recognition (OCR) API that sets a new standard in document understanding. This advanced

Read More
Introducing the Logitech Rally Board 65

Introducing the Logitech Rally Board 65

Introduction The Logitech Rally Board 65 is an all-in-one video conferencing solution designed to simplify meetings and collaboration in business environments. With its 65-inch touchscree

Read More
Mistral Code, the European AI development assistant

Mistral Code, the European AI development assistant

Introduction French startup Mistral AI, already recognized for its open source language models, has just unveiled Mistral Code, an intelligent development assistant designed for businesse

Read More
Anthropic introduces Claude 4, the more powerful and durable AI

Anthropic introduces Claude 4, the more powerful and durable AI

Introduction In an artificial intelligence market dominated by OpenAI, Google, and Microsoft, Anthropic continues to forge its own path. With the launch of the Claude 4 family, th

Read More
New Yealink MeetingBar A50 for Teams Rooms

New Yealink MeetingBar A50 for Teams Rooms

Introduction In an increasingly hybrid work world, businesses are looking for video conferencing solutions that are powerful, easy to deploy, and seamlessly integrated into their *Microsoft

Read More
Mercedes-Benz, your car becomes a rolling office

Mercedes-Benz, your car becomes a rolling office

Introduction In an automotive market increasingly focused on smart and connected mobility, Mercedes-Benz is taking a giant leap forward. With the new generation of the CLA model, the Ge

Read More
Anthropic unveils Claude Opus 4.1, faster and more reliable

Anthropic unveils Claude Opus 4.1, faster and more reliable

Introduction Anthropic, a leading player in artificial intelligence, has announced the release of Claude Opus 4.1, a significant update to its flagship model (Claude Opus 4). Designed

Read More
OpenAI unveils GPT-5, its latest smarter model

OpenAI unveils GPT-5, its latest smarter model

Introduction OpenAI has taken another step forward in the evolution of artificial intelligence with the launch of GPT-5, its most powerful language model to date. Designed to be smarter

Read More
What's new for Copilot in August 2025

What's new for Copilot in August 2025

Introduction Microsoft releases a monthly update to Microsoft 365 Copilot to keep admins and users up-to-date on productivity-enhancing features in Microsoft 365. The August 2025 release

Read More