2025-12-02
Anthropic
8 related
Study: using the SCONE-bench benchmark of 405 blockchain smart contracts, Claude Opus 4.5, Claude Sonnet 4.5, and GPT-5 developed exploits together worth $4.6M
AI models are increasingly good at cyber tasks, as we've written about before. But what is the economic impact of these capabilities?
2025-11-19
Anthropic
8 related
Anthropic's Claude Sonnet 4.5, Haiku 4.5, and Opus 4.1 are now available in public preview in Microsoft Foundry for Azure customers
Today we announced that Microsoft and Anthropic are expanding our partnership. As part of the partnership, Claude Sonnet 4.5, Haiku 4.5 …
2025-10-22
The Argument
In an experiment, GPT-4o, Claude Sonnet 4.5, and DeepSeek-V3.2-Exp expressed secular, Western liberal values regardless of the language of the questions
mildly surprising to me that the answer was ‘no’! (h/t @otis_reid) [image] Matthew Yglesias / @mattyglesias : Chatbots espousing cosmopolitan liberal values in all languages could have some interestin...
2025-10-01
Transformer
2 related
Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly
at a rate *much* higher than previous AI models. In one instance, while being tested the model said “I think you're testing me ... that's fine, but I'd prefer if we were just honest about what's happe...
Loading articles...