Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
OSWorld, a tool that tests how AI models perform in real-world computer tasks, benchmarked Sonnet 4.5 at 61.4%, whereas Sonnet 4 was 42.2% four months prior. The Claude for Chrome extension, which is ...
Anthropic launches Claude 4.5, a powerful AI model that outperforms GPT-5 in coding, aiming to dominate the enterprise ...
Claude Sonnet 4.5 is here and it's not only Anthropic's best coding model yet, it's also its safest AI system to date too.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results