Google says Gemini 3 Pro scores 1,501 on LMArena, above 2.5 Pro, and demonstrates PhD-level reasoning with top scores on Humanity's Last Exam and GPQA Diamond
Google today announced Gemini 3 with the goal of bringing “any idea to life.” The first model available in this family …
Google says the median Gemini app text prompt consumes 0.24Wh of energy, about the same as running a microwave for a second, and emits 0.03g of CO2 equivalent
an official report confirms that Gemini consumes per query: — 0.24 Wh of energy (~9 seconds of TV) — 0.03 g of CO2 equivalent — 0.26 ml of water (about 5 drops) — blog: cloud.google.com/blog/prod...
A profile of UCB CS professor Ion Stoica, a billionaire who has helped launch four startups out of his privately-funded research lab, including Databricks
championed by colleagues like @trevordarrell, @pabbeel, @istoica05, and others—is leading the nation, unleashing the creativity of top students and research labs through industry-sponsored open-source...
Google releases an upgraded preview of Gemini 2.5 Pro, saying its Elo score jumped by 24 points on LMArena and it leads in coding benchmarks like Aider Polyglot
Abner Li / 9to5Google :
AI model ranking project LMArena spins off from UC Berkeley and raises a $100M seed led by a16z and UC Investments, sources say at a $600M valuation
An effort that began at UC Berkeley as a way to rank AI models has raised $100 million from investors including Andreessen Horowitz.
LMArena says it's starting a company, whose corporate name will be Arena Intelligence, with plans to raise money, and releases a new beta version of its website
fixing errors/bugs, improving our UI layout, and more. To keep supporting the development and continual improvement of this platform, we're also forming a company. Future improvements will continue ...
Meta VP of Generative AI Ahmad Al-Dahle denies a rumor that the company trained Llama 4 Maverick and Scout on test sets, saying that Meta “would never do that”
but the EU doesn't get everything Pascale Davies / Euronews : From a political shift to a more powerful AI: Everything to know about Meta's Llama 4 models Jay Bonggolto / Android Central : Meta is com...
LMArena says it is updating its leaderboard policies after a Llama 4 Maverick version, which Meta said in fine print is not public, secured the number two spot
With Llama 4, Meta fudged benchmarks to appear as though its new AI model is better than the competition.
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Flash Thinking, an exp...