News

Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 ...
Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
Claude Sonnet 4 is the smaller model, but it's still a major upgrade in power from the earlier Sonnet 3.7. Anthropic claims Sonnet 4 is much better at following instructions and coding.
Anthropic has launched two new AI models -- Claude 4 Sonnet and Claude 4 Opus -- at its inaugural developer conference. The AI company claims Opus is the best coding model in the world.
The new rate limits will go into effect August 28 for subscribers to Anthropic's $20-per-month Pro plan, as well as its $100- ...
Anthropic has introduced its next-generation AI models: Claude Opus 4 and Claude Sonnet 4. These models offer two modes, quick responses and deeper thinking for complex problems. They are available on ...
The Claude 4 family is also less likely than Sonnet 3.7 to engage in "reward hacking," claims Anthropic. Reward hacking, also known as specification gaming, is a behavior where models take ...
Anthropic, the Amazon-backed OpenAI rival, on Thursday launched its most powerful group of AI models yet: Claude 4. It stopped investing in chatbots at the end of last year and has instead focused ...
March 4, 2024 — Anthropic today announced the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in ...