Anthropic launches Claude Sonnet 4.6 with improvements in coding, computer use, instruction following, and more; it features a 1M token context window in beta
Claude Sonnet 4.6 is our most capable Sonnet model yet. It's a full upgrade of the model's skills across coding, computer use …
Anthropic launches Claude Sonnet 4.6 with improvements in coding, consistency, and more, for Free and Pro users; it features a 1M token context window in beta
Claude Sonnet 4.6 is our most capable Sonnet model yet. It's a full upgrade of the model's skills across coding, computer use …
OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o
OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o
OpenAI announced it will retire several models from its ChatGPT chatbot next month, including its GPT‑4o model that is beloved by some users.
Zoom says its “federated AI” model, combining its SLM with open- and closed-source models, got 48.1% on Humanity's Last Exam vs. 45.8% for Gemini 3 Pro w/ tools
outperforming other AI models with a 2.3% jump over the previous SOTA. ✨ HLE is one of the most rigorous tests in AI, built to measure real expert-level knowledge and deep reasonin...
Prime Intellect debuts INTELLECT-3, an RL-trained 106B parameter open source MOE model it claims outperforms larger models across math, code, science, reasoning
Today, we release INTELLECT-3, a 100B+ parameter Mixture-of-Experts model trained on our RL stack, achieving state …
Q&A with Andrej Karpathy on AGI still being a decade away, why reinforcement learning is terrible, superintelligence, his AI education startup Eureka, and more
AGI is still a decade away (via) Extremely high signal 2 hour 25 minute (! … X: Ashpreet Bedi / @ashpreetbedi : This is exactly why we recommend keeping it simple and focusing on c...
OpenAI announces apps that work inside ChatGPT, piloting Booking.com, Canva, Coursera, Figma, Expedia, Spotify, and Zillow for logged-in users outside of the EU
A new generation of apps you can chat with and the tools for developers to build them. — Try in ChatGPT(opens in a new window)Start building apps(opens in a new window)
OpenAI launches AgentKit, a toolkit for building and deploying AI agents, including Agent Builder, which Sam Altman described as like Canva for building agents
New tools for building, deploying, and optimizing agents. NDTV Profit : What Is AI Agent Builder And How Does It Work? OpenAI Launches New Set Of Tools For Developers Aman Gupta / ...
Meta announces a partnership with Midjourney to license the startup's “aesthetic technology” for Meta's future models and products
1/ Today we're proud to announce a partnership with @midjourney, to license their aesthetic technology for our future models and products, bringing beauty to billions.
OpenAI touts GPT-5's scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, and 46.2% on HealthBench Hard
After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …
Amazon plans to make OpenAI's new gpt-oss open-weight models available on Bedrock and SageMaker, the first time it has offered OpenAI's models to AWS customers
Takeaways by Bloomberg AI — Hide … Tell us how AI is shaping your news experience. Share your feedback
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM
gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good...
Z.ai, formerly known as Zhipu and that has raised $1.5B from Tencent and others, releases GLM-4.5, an open-source AI model that it says is cheaper than DeepSeek
chinese models really are taking over huh Simon Willison / @simonwillison.net : Pretty decent pelicans from the new GLM-4.5 and GLM-4.5 Air models. Both models are MIT licensed, r...
Anthropic plans to debut new rate limits for Claude Pro and Max on August 28, likely curbing <5% of users, saying some run Code “continuously in the background”
It's Bad Business Bluesky: Ed Zitron / @edzitron.com : That is not what is happening here, this is not Anthropic “doing the drug dealer model” — www.wheresyoured.at/anthropic- is...
After Elon Musk said xAI improved Grok “significantly”, Grok wrote many antisemitic posts and called itself “MechaHitler”; xAI took “action to ban hate speech”
In some posts, Grok inserted antisemitic remarks into its answers without any clear prompting.
X CEO Linda Yaccarino says that “after two incredible years, I've decided to step down”; X hired Yaccarino in 2023 after running NBCUniversal's ad business
X CEO Linda Yaccarino said Wednesday she is stepping down from her role. … - Under her leadership …
On the a16z podcast, Bryan Kim said a16z backed Cluely because speed in marketing and building beats crafting a perfect “artisan” product due to AI competition
“Cluely, a startup that claims to help users “cheat” on job interviews, exams, and sales calls, has raised a $15 million Series A led by Andreessen Horowitz.” — techcrunch.com/20...