News

Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
More improvements will be rolled out in the coming weeks, the company said. The price for Opus 4.1 remains the same as that ...
Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 ...
Anthropic has released Claude Opus 4.1, which is said to deliver better coding and agent performance with improved safety.
Explore Claude Opus 4.1, Anthropic’s groundbreaking new AI model with advanced coding, multilingual, and problem-solving capabilities. Opus AI ...
Anthropic claims the new AI model “improves Claude’s in-depth research and data analysis skills, especially around detail ...
Anthropic launched Claude Opus 4.1 today, an upgraded version of its flagship AI model that achieves 74.5% accuracy on ...
Anthropic's Claude Opus 4.1 achieves 74.5% on coding benchmarks, leading the AI market, but faces risk as nearly half its $3.1B API revenue depends on just two customers.
Anthropic retired its Claude 3 Sonnet model. Several days later, a post on X invited people to celebrate it: "if you're ...
Anthropic Claude 4.1 was released today is an upgrade to Anthropic's Claude Opus 4 model with key improvements. This was ...
AI's Grok 4 has dominated Day 1 of Google's Kaggle Game Arena, a new chess tournament testing the strategic reasoning of top ...