Meta agrees to acquire up to 6GW of AMD Instinct GPUs in a deal valued at $100B+ that could see Meta own up to 10% of AMD; Meta plans to deploy 1GW in 2026
Meta agrees to acquire up to 6GW of AMD Instinct GPUs in a deal valued at $100B+ that could see Meta own up to 10% of AMD; Meta plans to deploy 1GW in 2026
The deal could result in Meta owning as much as 10% of AMD's stock as the chip maker seeks to challenge Nvidia
Moonshot says Kimi K2.5 builds on K2 with “pretraining over ~15T mixed visual and text tokens” and “can self-direct an agent swarm with up to 100 sub-agents”
Today, we are introducing Kimi K2.5, the most powerful open-source model to date.
Qwen releases Qwen3-Max-Thinking, its flagship reasoning model that it says demonstrates performance comparable to models such as GPT-5.2 Thinking and Opus 4.5
· QwenTeam丨Translations:.体中文 — Introduction# — We present Qwen3-Max-Thinking, our latest flagship reasoning model.
Chinese startup Moonshot releases Kimi K2.5, saying the model can process text, images, and videos simultaneously and beats its open-source peers in some tests
Alibaba Group Holding Ltd.-backed Moonshot AI released an upgrade of its flagship model, heating up a domestic arms race ahead …
Qwen releases Qwen3-Max-Thinking, its flagship reasoning model that it says demonstrates performance comparable to models such as GPT-5.2 Thinking and Opus 4.5
Prime Intellect debuts INTELLECT-3, an RL-trained 106B parameter open source MOE model it claims outperforms larger models across math, code, science, reasoning
Today, we release INTELLECT-3, a 100B+ parameter Mixture-of-Experts model trained on our RL stack, achieving state …
Chinese startup Moonshot releases Kimi K2 Thinking, an open-weight model it claims beats GPT-5 in agentic capabilities; source: the model cost $4.6M to train
Chinese startup Moonshot on Thursday released its latest generative artificial intelligence model which claims to beat OpenAI's ChatGPT in …
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8....
Z.ai, formerly known as Zhipu and that has raised $1.5B from Tencent and others, releases GLM-4.5, an open-source AI model that it says is cheaper than DeepSeek
chinese models really are taking over huh Simon Willison / @simonwillison.net : Pretty decent pelicans from the new GLM-4.5 and GLM-4.5 Air models. Both models are MIT licensed, r...
Z.ai, formerly known as Zhipu and that has raised $1.5B from Tencent and others, releases GLM-4.5, an open-source AI model that it says is cheaper than DeepSeek
chinese models really are taking over huh Simon Willison / @simonwillison.net : Pretty decent pelicans from the new GLM-4.5 and GLM-4.5 Air models. Both models are MIT licensed, r...
Microsoft plans to shut down its Bing Search APIs on August 11; a source says the largest customers will retain access, and DuckDuckGo says it won't be affected
Microsoft is limiting access to tools that boosted its rivals, but larger customers like DuckDuckGo say they won't be affected.
Microsoft plans to shut down its Bing Search APIs on August 11; a source says the largest customers will retain access, and DuckDuckGo says it won't be affected
Microsoft is limiting access to tools that boosted its rivals, but larger customers like DuckDuckGo say they won't be affected.
Microsoft plans to shut down its Bing Search APIs on August 11; a source says the largest customers will retain access, and DuckDuckGo says it won't be affected
Microsoft is limiting access to tools that boosted its rivals, but larger customers like DuckDuckGo say they won't be affected.
LMArena says it is updating its leaderboard policies after a Llama 4 Maverick version, which Meta said in fine print is not public, secured the number two spot
With Llama 4, Meta fudged benchmarks to appear as though its new AI model is better than the competition.
Source: Anthropic's annualized revenue grew from $1B at the end of 2024 to $1.4B in early March; Manus uses tools including Claude 3.7 Sonnet to power its agent
The Information :
Sam Altman says OpenAI was forced to stagger GPT-4.5's rollout because it is “out of GPUs”; the model is wildly expensive, costing $75 per million input tokens
Hopefully the output is worth it? 🤔 — Oh... 😥 — www.theverge.com/news/620021/ ... [embedded post] X: Ed Zitron / @edzitron : Also, $1.30 per hour per GPU is the Microsoft dis...
OpenAI launches GPT-4.5, its “most knowledgeable model yet” in research preview, initially warning it's not a frontier model and may perform below o1 or o3-mini
OpenAI's newest and largest model is being released as a research preview.
Anthropic releases Claude 3.7 Sonnet, a hybrid model that can produce fast responses or extended, step-by-step thinking, and Claude Code, an agentic coding tool
and it could be a game changer Ghacks : Anthropic Unveils Claude 3.7: First Hybrid Reasoning AI Model Rowan Cheung / The Rundown AI : Claude enters the reasoning era Siddharth Jind...
Claude 3.7 and Grok-3 are the first “Gen3” models with big gains in handling complex tasks, using 10x more compute than GPT-4-class models, and better reasoning
Note: After publishing this piece, I was contacted by Anthropic who told me that Sonnet 3.7 would not be considered …