_sholtodouglas

100% agree with his conclusions - Eric consistently predicts where the field is going.

2026-02-08 View on X

Evjang.com

A look at the state of AI agents, the evolution of thinking models, the staggering need for inference compute in the coming years, automated research, and more

View original

100% agree with his conclusions - Eric consistently predicts where the field is going.

2026-02-07 View on X

Evjang.com

A look at the state of AI agents, the evolution of thinking models, the staggering need for inference compute in the coming years, automated research, and more

— Dr. Vannevar Bush, As We May Think, 1945 — If we consider life to be a sort of open-ended MMO, the game server has just received a major update.

View original

2% to 4% in a month. Ludicrous. [image]

2026-02-07 View on X

SemiAnalysis

Analysis: Claude Code currently authors 4% of all public GitHub commits and is on track to cross 20% of all daily commits by the end of 2026

View original

2% to 4% in a month. Ludicrous. [image]

2026-02-06 View on X

SemiAnalysis

Analysis: Claude Code currently authors 4% of all public GitHub commits and is on track to cross 20% of all daily commits by the end of 2026

View original

Claude code for all other knowledge work. Many of our best engineers no longer manually write code, they multiplex across multiple cc sessions - soon this will be true for everything else

2026-01-13 View on X

ZDNET

Anthropic debuts Cowork for Claude, built on Claude Code, for automating complex tasks with minimal prompting, as a research preview for Claude Max subscribers

ZDNET's key takeaways — Anthropic is launching Cowork for Claude as a research preview. — It's built upon Claude Code and can automate complex tasks.

View original

This is beautiful. “AI is steel for organizations.” We'll be able to build structures of dizzying complexity that would buckle with the technology of our time.

2025-12-23 View on X

@ivanhzhao

How AI can work across scales, from individuals to organizations to economies, like steel and the steam engine before it, as AI arrives as “infinite minds”

companies with over 1 million people. Running an organization will start to feel like vibe coding. Sarah Guo / @saranormous : an inspired view of the organizations to come @alth0u ...

View original

This was a truly eerie threshold for me

2025-11-25 View on X

VentureBeat

Anthropic says Opus 4.5 outscored all humans on a take-home exam it gives to prospective performance engineering candidates, within a prescribed two-hour limit

Michael Nuñez / VentureBeat :

View original

This was a truly eerie threshold for me

2025-11-25 View on X

Anthropic

Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”

Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …

View original

I'm so excited about this model. First off - the most important eval. Everyone at Anthropic has been posting stories of crazy bugs that Opus found, or incredible PRs that it nearly solo-d. A couple of our best engineers are hitting the ‘interventions only’ phase of coding.

2025-11-25 View on X

Anthropic

Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”

Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …

View original

Dario's essays and long debate slack threads are one of my favorite parts of Anthropic's culture. They're open, detailed - and incredibly raw. Everyone at the company ends up having a good sense of how the company is making decisions and what matters. Its the kind of thing that

2025-11-25 View on X

Anthropic

Anthropic launches Claude Opus 4.5, saying it is “the best model in the world for coding, agents, and computer use” and “meaningfully better at everyday tasks”

Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient …

View original

We're sorry - and we'll do better. We're working hard on making sure we never miss these kind of regressions and rebuilding our trust with you.

2025-09-18 View on X

Anthropic

Anthropic details three infrastructure bugs that intermittently degraded Claude's responses between August and early September, and explains how it fixed them

This is a technical report on three bugs that intermittently degraded responses from Claude. Below we explain what happened …

View original

A taste of what we've been thinking about recently :) Try it out! Its still a little raw, we expect it to have sharp edges - but it represents incredible algorithmic progress on test time compute. Also check out the thoughts - its fun, and a little humanizing.

2024-12-20 View on X

TechCrunch

Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning

Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...

View original

I really like the thoughts in this problem, a cute example of out of the box thinking. As models get stronger, taking them seriously will continue to be the right way to understand both the current gen - and what will be possible in even 3 months. [image]

2024-12-20 View on X

TechCrunch

Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning

Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...

View original