Alibaba debuts Qwen3.5, a 397B-parameter open-weight multimodal AI model that it says is 60% cheaper to use and 8x better at large workloads than Qwen3
Alibaba debuts Qwen3.5, a 397B-parameter open-weight multimodal AI model that it says is 60% cheaper to use and 8x better at large workloads than Qwen3
Z.ai launches GLM-5, saying its flagship open-weight model has “best-in-class performance among all open-source models” in reasoning, coding, and agentic tasks
We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways …
Z.ai says it will raise prices by at least 30% for new GLM coding plan subscribers to accommodate surging demand for its AI coding tools
Z.ai launches GLM-5, its flagship open-weight model, saying it has best-in-class performance among open-source models in reasoning, coding, and agentic tasks
We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways …
Nvidia says it will begin selling the DGX Spark mini PC, with DGX OS, for AI developers on October 15 on Nvidia.com and select third-party retailers for $3,999
(PCMag/Michael Kan) … It's not a consumer desktop, but Nvidia's foray into an AI developer-focused mini PC is finally ready to launch.
ByteDance debuts Doubao-1.5-pro, claiming its latest flagship AI model outperforms OpenAI's o1 in AIME benchmarks, joining DeepSeek in China's AI reasoning push
TikTok owner ByteDance on Wednesday released an update to its flagship AI model aimed at challenging Microsoft-backed OpenAI's …
DeepSeek releases DeepSeek-V3, an open-source MoE model of 671B total parameters, with 37B activated per token, claiming it outperforms top models like GPT-4o
Chinese AI startup DeepSeek, known for challenging leading AI vendors with its innovative open-source technologies, today released a new ultra-large model: DeepSeek-V3.
xAI launches Grok-2 and Grok-2 mini in beta with improved reasoning, available to X Premium and Premium+ users, and plans to add them to its API later in August
Elon Musk-owned launched Grok-2 and Grok-2 mini in beta today with improved reasoning. The new Grok AI model can now generate images …
Google unveils updates to its Gemma family of open models, including Gemma 2 2B, which it claims surpasses GPT-3.5 and Mixtral 8x7B on the LMSYS Chatbot Arena
Google has unveiled updates to its Gemma 2 family of open-source language models, focusing on improved performance, safety, and transparency.
Google says Gemma 2 will be available to researchers and developers through Vertex AI from July as a new 9B-parameter model and the prior 27B-parameter model
Nvidia announces Nemotron-4 340B, a family of models that developers can use to generate synthetic data for training LLMs for commercial applications
Nemotron-4 340B, a family of models optimized for NVIDIA NeMo and NVIDIA TensorRT-LLM, includes cutting-edge instruct and reward models, and a dataset for generative AI training.
Nvidia announces Nemotron-4 340B, a family of models that developers can use to generate synthetic data for training LLMs for commercial applications
Nemotron-4 340B, a family of models optimized for NVIDIA NeMo and NVIDIA TensorRT-LLM, includes cutting-edge instruct and reward models, and a dataset for generative AI training.
A look at gpt2-chatbot, a mysterious AI chatbot that became available on LLM benchmarking site LMSYS Org and is believed to have similar capabilities as GPT-4
An advanced AI model with unknown origins turned heads this week as online communities unpacked a cryptic tweet from Sam Altman.
A look at gpt2-chatbot, a mysterious AI chatbot that became available on LLM benchmarking site LMSYS Org and is believed to have similar capabilities as GPT-4
An advanced AI model with unknown origins turned heads this week as online communities unpacked a cryptic tweet from Sam Altman.
Anthropic's Claude 3 Opus surpassed OpenAI's GPT-4 on Chatbot Arena, a crowdsourced LLM leaderboard used by AI researchers; GPT-4 has been first since launch
Anthropic's Claude 3 is first to unseat GPT-4 since launch of Chatbot Arena in May '23. — On Tuesday, Anthropic's Claude 3 …
Anthropic's Claude 3 Opus surpassed OpenAI's GPT-4 on Chatbot Arena, a crowdsourced LLM leaderboard used by AI researchers; GPT-4 has been first since launch
Anthropic's Claude 3 is first to unseat GPT-4 since launch of Chatbot Arena in May '23. — On Tuesday, Anthropic's Claude 3 …