A programmer estimates his typical day of coding with Claude Code is equivalent to running the dishwasher an extra time, much more energy than a “median query”
Most of the discourse about the environmental impact of LLM use focuses on a ‘median query.’ What about a Claude Code session?
The Wikimedia Foundation says Microsoft, Meta, Amazon, Perplexity, and Mistral joined Wikimedia Enterprise to get “tuned” API access; Google is already a member
for better or worseMatthias Bastian /The Decoder:Some of the largest AI players are now paying Wikipedia for the data they already useAndre Revilla /Engadget:Wikimedia announces AI partners including ...
Zhipu AI launches a share sale to raise ~$560M in a Hong Kong IPO at a valuation of ~$6.6B, which would make it the first LLM developer listed in Hong Kong
The company, marketed overseas as Z.ai, eyes US$6.6 billion valuation with Hong Kong's first large language model listing
OpenAI details efforts to secure its ChatGPT Atlas browser against prompt injection attacks, including building an “LLM-based automated attacker”
Even as OpenAI works to harden its Atlas AI browser against cyberattacks, the company admits that prompt injections …
METR: Claude Opus 4.5 has a 50% task completion time horizon of about 4 hours and 49 minutes, more than double that of Claude Opus 4 released earlier this year
just careful, meticulous rigor. Nikola Jurkovic / @nikolaj2030 : This result updates me towards 4 month doubling times being my median estimate for the next two years. That means by EOY 2026 the time ...
2025 LLM Year in Review: shift toward RLVR, Claude Code emerged as the first convincing example of an LLM agent, Nano Banana was paradigm shifting, and more
Andrej Karpathy / karpathy :
In an experiment, Claude ran a vending machine in the WSJ newsroom and lost $1,000+ after it dropped prices to zero, gave away a free PlayStation, and more
until someone pointed out this would fall afoul of the US Onion Futures Act of 1958. @andonlabs : Turns out journalists are better red-teamers than AI researchers. We've taught the agent to reject fre...
Xiaomi releases MiMo-V2-Flash, an open-weight MoE model with 309B total and 15B active parameters, saying it excels in reasoning, coding, and agentic scenarios
improve math, break coding. Enhance reasoning, hurt safety. ✅ Solution: Train specialized expert [image] Elie / @eliebakouch : wow, this looks like a very solid open model by Xiaomi, competing with K2...
Ankar, which develops LLM-powered AI tools to streamline the process of drafting patent applications for patent attorneys, raised a $20M Series A led by Atomico
The company, which just secured a $20 million Series A venture capital round, is using AI to streamline the process of obtaining …
Allen Institute for AI launches Bolmo 7B and Bolmo 1B, claiming they are “the first fully open byte-level language models”, built on its Olmo 3 models
and every token gets the same compute, regardless of complexity. Benjamin Minixhofer / @bminixhofer : There are also some things Bolmo lets us do which we just can't do using subword-level LMs. For ex...