An analysis of 100T+ tokens from the past year shows reasoning models now represent over half of all usage, open-weight model use has grown steadily, and more
this is not a model I hear much about. [image] @openrouterai : We collaborated with @a16z to publish the **State of AI** - an empirical report on how LLMs have been used on OpenRouter. After analyzing...
Allen Institute for AI, or Ai2, unveils Olmo 3 models that it says outperform open models like Stanford's Marin and commercial open-weight models like Llama 3.1
Artifacts for the Olmo 3 release. … Note 🧱Base version of Olmo 3 32B. Michal Sutter / MarkTechPost : Allen Institute for AI (AI2) Introduces Olmo 3: An Open Source 7B and 32B LLM Family Built on the D...
OpenAI debuts GPT‑5-Codex, a version of GPT‑5 optimized for agentic coding in Codex and says it spends its “thinking” time more dynamically than previous models
If You Ask Nicely Frederic Lardinois / The New Stack : OpenAI Launches a New GPT-5 Model for Its Codex Coding Agent David Gewirtz / ZDNET : OpenAI has new agentic coding partner for you now: GPT-5-Cod...
GPT-5 hands-on: it exudes competence but doesn't feel like a dramatic leap ahead of other LLMs, and the pricing is aggressively competitive with other providers
And It Changes Everything Tyler Cowen / Marginal Revolution : GPT-5, a short and enthusiastic review GPT-5 : GPT-5 — Our hands-on review of OpenAI's newest model based on weeks of testing — The Ve...
OpenAI says GPT-5 is a unified system with an efficient model for most questions, a reasoning model for harder problems, and a router that decides which to use
All You Need To Know Lakshay Kumar / Business Today : What is GPT-5? How OpenAI is upgrading your ChatGPT experience Tsveta Ermenkova / PhoneArena : You can now chat with a PhD-level AI that knows whe...
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM
gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good OpenAI on GitHub : ...
Mira Murati's Thinking Machines Lab raised a $2B seed led by a16z at a $12B valuation; Nvidia, Accel, ServiceNow, Cisco, AMD, and Jane Street also invested
bsky.app/profile/wire... [embedded post] @akhilrao : i feel like i've known Murati is at a startup called Thinking Machine Labs for months. maybe idk what “stealth” means [embedded post] Michael / @f...
Sources: Anthropic's revenue hit a pace of $4B/year, up almost 4x from the beginning of 2025; source: Cursor maker Anysphere hired two of Claude Code's leaders
What's notable is that Cursor just poached the lead engineer and PM for Claude Code. I guess this is now an industry trend. Threads: Dare Obasanjo / @carnage4life : Anthropic is at $4B/year revenue w...
Google launches Gemini CLI, an agentic AI tool that lets developers make natural language requests in terminals by connecting Gemini models to local codebases
This repository contains the Gemini CLI, a command-line AI workflow tool … The Keyword : Gemini CLI: your open-source AI agent Vinay Patel / International Business Times : Google's Gemini CLI Lands in...
OpenAI announces an 80% price drop for its o3 model and a “flex” mode for synchronous processing that charges $5 for input and $20 for output per million tokens
just cheaper. https://platform.openai.com/ ... [image] Kevin Weil / @kevinweil : Because you all asked: we're going to double the rate limits for o3 for Plus users. Rolling out as we speak. Now go do ...