2025-12-12
OpenAI announces GPT 5.2 — openai.com/index/intro... Highlights: — “Most advanced model for professional work and long-running agents” — 70.9% on GDPval - wins/ties vs human experts across 44 occupations — SWE-Bench Pro: 55.6% (new SOTA)
OpenAI
OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost
OpenAI eyes January exit from “code red” John Werner / Forbes : The Wonder And The Promise Of GPT 5.2 Is Here Benj Edwards / Ars Technica : OpenAI releases GPT-5.2 after “code red”...
2025-10-16
Claude Haiku 4.5 is now out. — $1/$5 per million input/output tokens — Should replace sonnet 4 in performance — 4-5x faster than sonnet 4.5 — www.anthropic.com/news/claude...
Anthropic
Anthropic says Haiku 4.5 can serve as a subagent for Sonnet 4.5, which can break down problems into multistep plans and orchestrate a team of Haiku 4.5 agents
Claude Haiku 4.5, our latest small model, is available today to all users. — What was recently at the frontier is now cheaper and faster.
2025-07-18
Sama begins the livestream with “we have a banger for you today” — www.youtube.com/watch?v=1jn...
The Verge
OpenAI debuts ChatGPT Agent, which can control an entire computer and perform multi-step tasks, powered by a new dedicated model, rolling out to paid users
One employee uses it to automate his weekly parking requests at OpenAI's San Francisco office.
2025-07-15
Amazon is officially in the vibe coding game. This is significant because aws also has a lot of inference + compliance stuff, so they may be able to make big gains on the enterprise front. — kiro.dev/blog/introd...
GeekWire
Amazon launches Kiro, an IDE that aims to bridge the gap between rapidly vibe-coded prototypes and production systems with specs, testing, and documentation
A new agentic IDE that works alongside you from prototype to production — DE Tim Anderson / DEVCLASS : Hands on with Kiro, the AWS preview of an agentic AI IDE driven by specific...