Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
OSWorld, a tool that tests how AI models perform in real-world computer tasks, benchmarked Sonnet 4.5 at 61.4%, whereas Sonnet 4 was 42.2% four months prior. The Claude for Chrome extension, which is ...
Anthropic launches Claude 4.5, a powerful AI model that outperforms GPT-5 in coding, aiming to dominate the enterprise ...
Claude Sonnet 4.5 is here and it's not only Anthropic's best coding model yet, it's also its safest AI system to date too.