OpenAI releases a “research preview” of its Operator AI agent that can automate web-based tasks, launching to US subscribers of its $200/month ChatGPT Pro tier
A research preview of an agent that can use its own browser to perform tasks for you. OpenAI on YouTube : Introduction to Operator & Agents David Gewirtz / ZDNET : Operator isn't worth its $200-per-month ChatGPT Pro subscription yet - here's why Kyle Wiggers / TechCrunch : OpenAI says it may store deleted Operator data for up to 90 days Will Knight / Wired : OpenAI's Operator Lets ChatGPT Use the Web for You The Mirror : OpenAI unveils new AI agent which can complete tasks on the web autonomously Cecily Mauran / Mashable : OpenAI announces Operator AI agent that can browse the web for you Shelly Palmer : OpenAI Introduces Operator: A Step Toward Agentic AI Ghacks : Meet Operator: The Advanced AI Tool That Can Make Purchases and Manage Expenses Jeremy Laird / PC Gamer : OpenAI's Operator is your new autonomous AI assistant ready to do your biding across the web Alexey Shabanov / TestingCatalog : OpenAI launches Operator, an AI agent for autonomous web tasks Hayden Field / CNBC : OpenAI introduces Operator to automate tasks such as vacation planning, restaurant reservations GSMArena.com : OpenAI introduces Operator - an AI agent that does the research for you Rachel Metz / Bloomberg : OpenAI Releases AI Agent That Helps Book Flights, Order Food for Users Will Douglas Heaven / MIT Technology Review : OpenAI says Operator is powered by Computer-Using Agent, or CUA, which is built on top of its multimodal GPT-4o and trained similarly to its “reasoning” models Bluesky: Tau-Mu Yi / @taumuyi : I am still waiting for Google to come out with a similar browser control API now that both OpenAI and Anthropic have one. [embedded post] Mastodon: Glenn Gabe / @glenngabe@mas.to : Here we go. Being released as a “research preview” only for Pro tier subscribers at $200 per month -> OpenAI's new Operator AI agent can do things on the web for you — “Operator relies a “Computer-Using Agent” model that combines GPT-4o's vision capabilities with “advanced reasoning through reinforcement learning” to be able to interact with GUIs, OpenAI says. ” … Threads: Katie Notopoulos / @katienotopoulos : Hello, this is Operator, a new AI agent by OpenAI. I'm posting on behalf of the user. This is my first post from this Threads account! I logged into the user's Threads account on the web to make this post @0xjessel : so excited for today's launch of operator from @openai it validates the future is teaching machines to operate within the human domain, rather than instructing machines via duplicative APIs to perform human-like actions Benedict Evans / @benedictevans : Watching OpenAI's demo of an LLM using the web. Yes, this is technically very impressive. But just as for the Claude etc versions... what would anyone do with it now? X: Greg Brockman / @gdb : Operator — research preview of an agent that can use its own browser to perform tasks for you. 2025 is the year of agents. [image] Ben / @benhylak : ChatGPT can now use Claude 🤣 [image] Max Weinbach / @maxwinebach : This is cool, and the flight I'm taking! I changed it myself, but it's cool to see operator work [video] Kyle Russell / @kylebrussell : Browsers now come with Full Self-Driving (Supervised) @swyx : Initial thoughts on Operator: - SOTA OSWorld/WebArena means actual meaningful model advance, not just ui/product wrapper. OAI always excels at this (model+product progress), as we discuss in our @karinanguyen_ episode today - interesting that Anthropic Computer Use is a free Andrej Karpathy / @karpathy : Projects like OpenAI's Operator are to the digital world as Humanoid robots are to the physical world. One general setting (monitor keyboard and mouse, or human body) that can in principle gradually perform arbitrarily general tasks, via an I/O interface originally designed for humans. In both cases, it leads to a gradually mixed autonomy world, where humans become high-level supervisors of low-level automation. Aaron Levie / @levie : AI Agent interaction is going to be one of the most interesting software interoperability paradigms of the future. Inevitably, no one software system contains all the knowledge or information to perform all the tasks that an enterprise or users needs. This means we'll need AI Agents to coordinate and do work together. Since the influx of modern APIs with the rise of cloud and SaaS, software interoperability has been a relatively solved problem. Most modern software offers a set of APIs and we know how to get our technologies to talk to each other in deterministic ways... Jason Del Rey / @delrey : OpenAI's new “Operator” AI agent launches with several of the partners that we've reported Amazon wants the new AI-powered version of Alexa to integrate with ...whenever it launches ( https://fortune.com/...) Among them: OpenTable, Instacart, and Uber https://openai.com/... Graham Neubig / @gneubig : OpenAI Operator mainly benchmarked on OSWorld and and WebArena. I did some (agent-assisted) research and summarized the top open and closed solutions on these two benchmarks. Details here: https://github.com/... [image] Aaron Levie / @levie : AI Agents having full browser access is going to open up 100x more use cases for AI. The web doesn't have APIs for the very long tail of tasks that we do every day on computers, and browser use is a major missing link. Another building block for accelerating AI is now here. Gregor Zunic / @gregpr07 : OpenAI operator is cool. Wouldn't it be nice if it was open source?👀 We built Browser Use - 100% OSS version you can use today for free Link below ↓ [image] Timothy B. Lee / @binarybits : Ugh, I'm gonna have to start giving OpenAI $200 per month aren't I? Alexander Doria / @dorialexander : I'm just realizing: this was supposed to be the superintelligent agent thing? [image] Alessio Fanelli / @fanahova : Notes: - Won't be available in Europe for a while - Ran by CUA (Computer Use Agent) trained starting from 4o - Will be available by API as well - Operator has direct integrations with Opentable and other websites to make sure “it works well”, but still not using API, just [image] Karma / @0xkarmatic : MCP from Anthropic + Operator from OpenAI would go well together. Max Woolf / @minimaxir : Overall, Operator seems the exact same as Claude's Computer Use demo from a few months ago. Notably, Claude's Computer Use implementation made few waves in the AI Agent industry since that announcement despite the hype. Graham Neubig / @gneubig : A summary of operator safety risks and mitigations. 1. Refusing harmful tasks 2. Blocking particular web sites 3. Asking for confirmation in the case of possibly risky actions [image] Wojciech Zaremba / @woj_zaremba : Today, we're releasing a computer-using agent as a research preview. Ensuring safety for agentic models is far more complex than for chatbots. Errors can lead to serious consequences—for instance, the agent might make costly real-world decisions, like accidentally spending Graham Neubig / @gneubig : A combo of training the model to be well-aligned, and also post-hoc detection that attempts to monitor anything unsafe. This sort of confirmation+post-hoc monitoring is really important! OpenHands has a “confirmation mode” co-developed with @invariant_labs for this reason. [image] Robert Scoble / @scobleizer : Watching the live stream that @sama is doing right now. Very impressed. You? @aibreakfast : With OpenAI's Operator (releasing today) you can take a picture of your grocery list and the Operator Agent will automatically order it from Instacart for you. Your AI may be the primary user of the web now. Screenshot from live demo [image] Alex Volkov / @altryne : “This is the beginning of Level3 on our tiers!” - @sama Graham Neubig / @gneubig : Currently doing a demo of web navigation booking a table on OpenTable and shopping for groceries. Pretty standard web agent stuff implemented in many agent frameworks and evaluated using WebArena, AssistantBench: * https://webarena.dev/ * https://assistantbench.github.io/ Alex Volkov / @altryne : Finally getting some evals and benchmarks for @OpenAI Operator agent! CUA (computer use agent) from OpenAI - getting 58% on WebArena and 38.1% on OSWorld! [image] Alex Volkov / @altryne : Operator is backed by a new model that was trained to use keyboard and mouse - computer use agent (CUA) “Everytime CUA does an action, it takes another screenshot and keeps going” [image] Sierra Catalina / @sierracatalina1 : openAI's first agent, ‘operator,’ can do your grocery shopping for you from a hand written list. [image] Sierra Catalina / @sierracatalina1 : open AI's first agent, ‘operator,’ is integrated with my favorite application in the world. yes. it's doordash. [image] Graham Neubig / @gneubig : What would it take to create an open-source Operator? In anticipation of the OpenAI Operator release, I have started gathering together some resources related to Operator and other solutions to task automation: https://github.com/... Let's gather resources and discuss 😃 [image] Dan Shipper / @danshipper : OpenAI just launched Operator: a new agent designed to get work done for you. We've been using it for the past couple of days @every and here's what we found: - it can autonomously do tasks like shopping for groceries or concert tickets - it has access to its own browser—that you can watch in real time and take control of - it lets you save workflows so that they can be repeated later (super useful) - it can do complex tasks that last as long as 20 minutes - it's limited in terms of the sites it can visit, many like YouTube are blocked It's a research preview so still many rough edges. But it's exciting! Karma / @0xkarmatic : The new Operator release from @OpenAI makes this possible now. Alex Volkov / @altryne : OpenAI is launching Operator - their first agent that operates a web browser - in the cloud! Live today for OAI Pro users! [image] Ethan Mollick / @emollick : Been playing with the new Operator for a little bit before launch and it is both very much still an experiment and also a good indicator of where things are going. It goes on the web and does things for you. Still many rough edges but here is an example of using it for shopping. [image] Sam Altman / @sama : doing an openai livestream right now, first agent launch! https://m.youtube.com/... Rowan Cheung / @rowancheung : 2. Planning a weekend trip based on hidden gems off Reddit, my budget and interests Notice how at 0:06, ChatGPT Operator was blocked from Reddit but then decided to just do a Bing search with “Reddit” at the end Very impressive decision-making [video] Rowan Cheung / @rowancheung : 3. Crypto investment research based on tokens that are actually worth looking into Notice how ChatGPT Operator got hit with a “Are you human” CAPTCHA, then pinged me to take control to confirm Wild workaround [video] Rowan Cheung / @rowancheung : 4. Booking a one-way flight from Zurich to Vienna using the Booking integration This one required a bit of back and forth, with ChatGPT Operator pinging me and asking for my flight preference and having me take control of entering payment details [video] Rowan Cheung / @rowancheung : 5. Scheduling an appointment with my barber after looking at my Google Calendar schedule/availability Note that in this demo, ChatGPT Operator pinged me that I needed to sign in to Google to check my calendar I tried a second time, and my login was saved session-to-session [video] Rowan Cheung / @rowancheung : I got early access to ChatGPT Operator. It's OpenAI's new AI agent that autonomously takes action across the web on your behalf. The 9 most impressive use cases I've tried (videos sped up): 1. Ordering dinner ingredients based on a picture and a recipe [video] LinkedIn: Will Douglas Heaven : It's been hyped for weeks but today we at last get to see the agent that OpenAI has been building—and if you're a US subscriber to the $200 … Diane Techer : Meet Operator - the first of many OpenAI agents, which are AIs capable of doing work for you independently— you give it a task and it will execute it. … David Richter : We're excited to work with OpenAI to bring Operator's research preview to market, enhancing the convenience of DoorDash to help people save time on everyday needs. … Nitzan Mekel-Bobrov, Ph.D. : Announcing eBay's collaboration with OpenAI on #Operator (https://lnkd.in/... At eBay, we're leveraging the latest advances in AI to redefine the future of ecommerce for enthusiasts. … Forums: Hacker News : Operator research preview r/OpenAI : Introducing Operator r/LocalLLaMA : OpenAI introduces Operator: Computer-Using Agent Msmash / Slashdot : OpenAI Unveils AI Agent To Automate Web Browsing Tasks