Alibaba releases the Qwen3-VL vision models, the Qwen3Guard “safety moderation” models, and three closed-weight models, including Qwen3-Max with 1T+ parameters
Qwen 50.6k — Safetensors qwen3_vl_moe Julian Nabil / Forbes Middle East : Alibaba Introduces Qwen3-Max AI Model With Over 1T Parameters Markus Kasanmascheff / WinBuzzer : Alibaba...
Alibaba's Hong Kong-listed shares hit a nearly four-year high after CEO Eddie Wu announced plans to increase AI spending beyond the $53B target over three years
Alibaba Group Holding Ltd.'s shares surged to their highest in nearly four years after revealing plans to ramp up AI spending past …
Alibaba debuts Qwen3-Max-Preview, its largest AI model with over 1T parameters, showcasing strong benchmark performance; the model is not open source
Chinese e-commerce giant Alibaba's “Qwen Team” of AI researchers has done it again. After a busy summer in which the AI lab released …
DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19
Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...
DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19
DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …
Alibaba releases Qwen-Image, an open-source AI image generation model focused on accurately rendering text, with support for alphabetic and logographic scripts
After seizing the summer with a blitz of powerful, freely available new open source language and coding focused AI models …
Alibaba researchers detail ZeroSearch, a technique that allows LLMs to develop advanced search capabilities via simulation, claiming it cuts costs by up to 88%
Researchers at Alibaba Group have developed a novel approach that could dramatically reduce the cost and complexity …
Xiaomi unveils open-source AI reasoning model MiMo, joining other Chinese tech leaders hoping to make a splash in the burgeoning AI field endorsed by Beijing
Xiaomi debuted MiMo a day after Alibaba unveiled the latest version of its own flagship model, amplifying a race between China's tech players …
Alibaba releases Qwen2.5-VL-32B, a 32B open model under Apache 2.0, claiming better math reasoning and alignment with human preferences than earlier 2.5 models
Qwen2.5-VL-32B: Smarter and Lighter. The second big open weight LLM release from China today - the first being DeepSeek v3-0324.
Source: Ilya Sutskever's startup Safe Superintelligence is raising $1B+ at a $30B+ valuation led by Greenoaks Capital, which plans to invest $500M
that outsmarts humans in a safe way—is heading for a $30 billion-plus valuation The Information : Ilya Sutskever's Startup in Talks to Raise Financing at $30 Billion Valuation Meyt...
OpenAI launches o3-mini, its latest reasoning model that the company says is largely on par with o1 and o1-mini in capabilities, but runs faster and costs less
OpenAI on Friday launched a new AI “reasoning” model, o3-mini, the newest in the company's o family of reasoning models.
Alibaba releases Qwen 2.5-Max, an AI model that the company's cloud unit claims “outperforms” GPT-4o, DeepSeek-V3, and Llama-3.1-405B “almost across the board”
Chinese tech company Alibaba (9988.HK) on Wednesday released a new version of its Qwen 2.5 artificial intelligence model …
Alibaba releases QvQ-72B-Preview, an experimental research model focused on “enhancing visual reasoning capabilities”, built on Qwen2-VL-72B
QVQ-72B-Preview is an experimental research model developed by the Qwen team … QwenLM on GitHub : Qwen2-VL — Introduction After a year's relentless efforts, today we are thrilled...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...
Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel 3D depth map in 0.3 seconds on a standard GPU, using a single image
Michael Nuñez / VentureBeat :
Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel 3D depth map in 0.3 seconds on a standard GPU, using a single image
Michael Nuñez / VentureBeat :
Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel depth map in 0.3 seconds on a standard GPU, using a single image
Apple's AI research team has developed a new model that could significantly advance how machines perceive depth …
Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel depth map in 0.3 seconds on a standard GPU, using a single image
Apple's AI research team has developed a new model that could significantly advance how machines perceive depth …
Meta announces Movie Gen, a suite of AI models for generating realistic video and audio clips; Movie Gen Video has 30B parameters and Movie Gen Audio has 13B
The next frontier in generative AI is video—and with Movie Gen, Meta has now staked its claim.
Meta announces Movie Gen, a suite of AI models for generating realistic video and audio clips; Movie Gen Video has 30B parameters and Movie Gen Audio has 13B
The next frontier in generative AI is video—and with Movie Gen, Meta has now staked its claim.