A look at the state of AI agents, the evolution of thinking models, the staggering need for inference compute in the coming years, automated research, and more
— Dr. Vannevar Bush, As We May Think, 1945 — If we consider life to be a sort of open-ended MMO, the game server has just received a major update.
Mira Murati's Thinking Machines Lab makes Tinker, its API for fine-tuning language models, generally available, adds support for Kimi K2 Thinking, and more
Tinker is a dream for multi-agent setups, Nathan Lambert / @natolambert : Please add olmo3 @johnschulman2 et al. The goal is to make it the foundational research infrastructure for...
Andrej Karpathy unveils nanochat, a full-stack training and inference implementation of an LLM in a single, dependency-minimal codebase, deployable in 4 hours
It provides a full ChatGPT-style LLM, including training, inference and a web Ui … X: Clem / @clementdelangue : Am I wrong in sensing a paradigm shift in AI? Feels like we're movin...
Mira Murati's Thinking Machines Lab raised a $2B seed led by a16z at a $12B valuation; Nvidia, Accel, ServiceNow, Cisco, AMD, and Jane Street also invested
bsky.app/profile/wire... [embedded post] @akhilrao : i feel like i've known Murati is at a startup called Thinking Machine Labs for months. maybe idk what “stealth” means [embedde...
Alibaba debuts its Qwen3 family of open-weight “hybrid” AI reasoning models, including Qwen3-235B-A22B, with 235B total parameters and 22B activated parameters
Chinese tech company Alibaba on Monday released Qwen3, a family of AI models the company claims matches …
Dealroom: AI coding assistant startups such as Anysphere and Augment have raised $433M so far in 2024 alone, bringing the total since January 2023 to $906M
Software engineering attracts investors but making money from generative artificial intelligence still eludes many
OpenAI co-founder John Schulman departs to join Anthropic and focus on AI alignment, and says “I'm not leaving due to lack of support for alignment research”
I shared the following note with my OpenAI colleagues today: I've made the difficult decision to leave OpenAI. This choice stems from my desire to deepen my focus on AI alignment, ...
Mark Zuckerberg argues that “open source AI” is the path forward, closed models are vulnerable to vendor lock-in and state-backed espionage, and more
RE: https://www.threads.net/... Dare Obasanjo / @carnage4life : You can find @zuck's full post here https://www.facebook.com/... Dare Obasanjo / @carnage4life : Mark Zuckerberg has...
Meta debuts Llama 3.1 405B, the “first frontier-level open source AI model”, as well as new Llama 3.1 70B and 8B models, and says it's working on Llama 4
Some developers are releasing versions of Llama 3, which has a context window of 8K+ tokens, with longer context windows, thanks to Meta's open-source approach
Meta releases Llama 2, its open-source LLM with double the context length, for free for research and commercial use, and expands its Microsoft partnership
Recent breakthroughs in AI, and generative AI in particular, have captured the public's imagination and demonstrated what those developing …
Meta releases Llama 2, its open-source LLM with double the context length, for free for research and commercial use, and expands its Microsoft partnership
Recent breakthroughs in AI, and generative AI in particular, have captured the public's imagination and demonstrated what those developing …
Google's I/O 2023 suggests that AI is a sustaining innovation for Big Tech; the true fight will be between the major players' centralized models and open source
Google's AI-heavy I/O suggests AI is a sustaining innovation for Big Tech; the true fight will be between major players' centralized models and open source
Some things in tech are shocking, but not surprising — think of a CEO of a struggling company losing their job.
Meta releases its Segment Anything Model and Segment Anything 1-Billion mask dataset, hoping to help researchers with computer vision and object identification
and Meta is sharing the code Katie Paul / Reuters : Meta releases AI model that can identify items within images GitHub : Segment Anything — Meta AI Research, FAIR — [Paper] [P...
OpenAI debuts GPT-4, claiming the model “surpasses ChatGPT in its advanced reasoning capabilities”, available in ChatGPT Plus and as an API that has a waitlist
Following the research path from GPT, GPT-2, and GPT-3, our deep learning approach leverages more data and more computation …