MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...
US startup Anthropic on Monday announced the launch of its new generative artificial intelligence model, Claude Sonnet 4.5, ...
Northwestern Medicine scientists have developed a comprehensive atlas of genetic coding sequences in both healthy adult ...
Claude Sonnet 4.5 is here and it's not only Anthropic's best coding model yet, it's also its safest AI system to date too.
Meta has released Code World Model (CWM), a 32-billion-parameter AI model for researchers that simulates code execution to ...
HyperionDev has announced the expansion of its coding and data bootcamps to meet national priorities on youth employment and ...