Chinese AI startup Z.ai releases its GLM-4.6V open-weight vision models, with support for native function calling, available in 106B- and 9B-parameter versions
The release includes two models in “large” and “small” sizes: — GLM-4.6V (106B), a larger 106-billion parameter model aimed at cloud-scale inference
OpenAI makes its Realtime API generally available with features like MCP support and debuts gpt-realtime, its most advanced speech-to-speech model, in the API
[video] @liodakis : Congrats to @pbbakkum on shipping gpt-realtime! It's been awesome watching him and the multimodal team sweat the details and get to a GA quality multimodal model. It's crazy to thi...
DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19
Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging Face : DeepSeek-V3.1 ...
A look at Apple's AI plans for WWDC; sources say it is testing its 3B, 7B, 33B, and 150B models via an internal Playground tool, and will name macOS 26 as Tahoe
but 4 major Apple Intelligence updates are on the horizon Sam Cross / T3 : macOS 26 leaks ahead of WWDC - or at least its full name does Craig Donaldson / Pocket-lint : WWDC 2025 could be light on AI ...
Alibaba releases open-source reasoning model QwQ-32B on Hugging Face and ModelScope, claiming comparable performance to DeepSeek-R1 but with lower compute needs
Introduction QwQ is the reasoning model of the Qwen series. Paul Barker / InfoWorld : Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1 Jose Antonio Lanz / Decrypt : Alibaba's Latest A...
OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025
12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI model to tackle compl...
Anthropic launches Message Batches API in beta, for batches of up to 10K queries that are processed within 24 hours and cost 50% less than standard API calls
a cost-effective way to process vast amounts of queries asynchronously. You can submit batches of up to 10,000 queries at a time. Each batch is processed within 24 hours and costs 50% less than standa...
Mistral announces Mistral Large 2, the new generation of its flagship model, with 123B parameters; commercial usage requires a separate license
Today, we are announcing Mistral Large 2, the new generation of our flagship model. Tobias Mann / The Register : Mistral Large 2 leaps out as a leaner, meaner rival to GPT-4-class AI models MD Ijaj Kh...
Nvidia and Mistral release Mistral NeMo, a 12B-parameter language model with a 128K-token context window, available under the Apache 2.0 open-source license
Mistral NeMo: our new best small model. A state-of-the-art 12B model … Jonathan Kemper / The Decoder : Mistral releases three new LLMs for math, code and general tasks X: Prince Canuma / @prince_canu...
OpenAI launches GPT-4o mini, a smaller, cheaper offshoot of GPT-4o, replacing GPT-3.5 Turbo, with support for all multimodal inputs and outputs coming soon
each step more refined Aiming for the ‘perfect training set’ - the ultimate concentrate of all human knowledge and creativity Whether Gemini 1.5 Jeff Harris / @jeffintime : hidden gem: GPT-4o mini sup...