Internal document: the US State Department moved its internal chatbot from Claude Sonnet 4.5 to GPT-4.1, after Trump's directive to cancel Anthropic contracts
Nextgov/FCW:
Internal doc: the State Department moved its internal chatbot from Claude Sonnet 4.5 to GPT-4.1, following Trump's directive to cancel Anthropic contracts
Study: using the SCONE-bench benchmark of 405 blockchain smart contracts, Claude Opus 4.5, Claude Sonnet 4.5, and GPT-5 developed exploits together worth $4.6M
AI models are increasingly good at cyber tasks, as we've written about before. But what is the economic impact of these capabilities?
Anthropic's Claude Sonnet 4.5, Haiku 4.5, and Opus 4.1 are now available in public preview in Microsoft Foundry for Azure customers
Today we announced that Microsoft and Anthropic are expanding our partnership. As part of the partnership, Claude Sonnet 4.5, Haiku 4.5 …
Google plans to release Gemini 3 Deep Think to Google AI Ultra subscribers in the coming weeks, once it passes further rounds of safety testing
and Google Says It Will Make Search Smarter Siddharth Jindal / Analytics India Magazine : Google Launches Gemini 3, Claims Benchmark Lead Over GPT-5.1 and Claude Sonnet 4.5
Cognition releases SWE-1.5, a new coding model in Windsurf, saying it partnered with Cerebras to serve SWE-1.5 at speeds up to 13x faster than Claude Sonnet 4.5
lots of new paradigms/UX to figure out. @cognition : Today we're releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for speed. Now available...
In an experiment, GPT-4o, Claude Sonnet 4.5, and DeepSeek-V3.2-Exp expressed secular, Western liberal values regardless of the language of the questions
mildly surprising to me that the answer was ‘no’! (h/t @otis_reid) [image] Matthew Yglesias / @mattyglesias : Chatbots espousing cosmopolitan liberal values in all languages could have some interestin...
Anthropic says Haiku 4.5 can serve as a subagent for Sonnet 4.5, which can break down problems into multistep plans and orchestrate a team of Haiku 4.5 agents
Claude Haiku 4.5, our latest small model, is available today to all users. — What was recently at the frontier is now cheaper and faster.
Google expands its AI coding agent Jules with a command-line interface and a public API, allowing it to plug into terminals, CI/CD systems, and tools like Slack
Jules in the command line — We're launching Jules Tools … Kathy Korevec / The Keyword : New ways to build with Jules, our AI coding agent Kathy Korevec / Google Developers Blog : Meet Jules Tools: A...
Anthropic's System Card: Claude Sonnet 4.5 was able to recognize many alignment evaluation environments as tests and would modify its behavior accordingly
at a rate *much* higher than previous AI models. In one instance, while being tested the model said “I think you're testing me ... that's fine, but I'd prefer if we were just honest about what's happe...