A look at the state of AI agents, the evolution of thinking models, the staggering need for inference compute in the coming years, automated research, and more
A look at the state of AI agents, the evolution of thinking models, the staggering need for inference compute in the coming years, automated research, and more
— Dr. Vannevar Bush, As We May Think, 1945 — If we consider life to be a sort of open-ended MMO, the game server has just received a major update.
Analysis: Claude Code currently authors 4% of all public GitHub commits and is on track to cross 20% of all daily commits by the end of 2026
Analysis: Claude Code currently authors 4% of all public GitHub commits and is on track to cross 20% of all daily commits by the end of 2026
Anthropic debuts Cowork for Claude, built on Claude Code, for automating complex tasks with minimal prompting, as a research preview for Claude Max subscribers
ZDNET's key takeaways — Anthropic is launching Cowork for Claude as a research preview. — It's built upon Claude Code and can automate complex tasks.
How AI can work across scales, from individuals to organizations to economies, like steel and the steam engine before it, as AI arrives as “infinite minds”
companies with over 1 million people. Running an organization will start to feel like vibe coding. Sarah Guo / @saranormous : an inspired view of the organizations to come @alth0u ...
Anthropic says Opus 4.5 outscored all humans on a take-home exam it gives to prospective performance engineering candidates, within a prescribed two-hour limit
Michael Nuñez / VentureBeat :
Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”
Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …
Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”
Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …
Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”
Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …
Anthropic details three infrastructure bugs that intermittently degraded Claude's responses between August and early September, and explains how it fixed them
This is a technical report on three bugs that intermittently degraded responses from Claude. Below we explain what happened …
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...