OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost

OpenAI eyes January exit from “code red” John Werner / Forbes : The Wonder And The Promise Of GPT 5.2 Is Here Benj Edwards / Ars Technica : OpenAI releases GPT-5.2 after “code red” Google threat alert Shelly Palmer : OpenAI's GPT-5.2 is a Maintenance Release with Strategic Implications Thomas Smith / Fast Company : OpenAI just released its new GPT-5.2 model. Here's what you need to know Ronil Thakkar / KnowTechie : OpenAI fires back at Google with GPT-5.2 The Week : OpenAI's flagship GPT-5.2 for ChatGPT is here: All you need to know GameRevolution : ‘ChatGPT 5.2 vs 5.1’ Trends as AI App Gets an Upgrade Alexey Shabanov / TestingCatalog : OpenAI launches GPT-5.2 that scores 54% on ARC-AGI-2 Tyler Lee / Phandroid : OpenAI Fires Back at Google with GPT-5.2 ChatGPT Upgrade Alex McFarland / Unite.AI : OpenAI Releases GPT-5.2 After Internal ‘Code Red’ Over Google's Gemini 3 Britney Nguyen / MarketWatch : OpenAI strikes back in the chatbot race against Google with new ChatGPT model Jason Nelson / Decrypt : OpenAI Launches GPT-5.2 Amid Expanded Major Contracts Ashley Capoot / CNBC : Sam Altman expects OpenAI to exit ‘code red’ by January after launch of GPT-5.2 model Mashable : GPT-5.2 vs Gemini 3 — How the two heavyweight models compare on benchmarks, price, and feature set Minh Le / Tech in Asia : OpenAI rolls out new GPT-5.2 model Blake Stimac / CNET : ChatGPT-5.2 Arrives: Here's What It Means for Everyday and Work Users PYMNTS.com : OpenAI Says New AI Model GPT-5.2 Unlocks ‘Even More Economic Value’ Juli Clover / MacRumors : OpenAI Launches GPT-5.2 for ChatGPT Users a Week After Declaring ‘Code Red’ Emily Forlini / PCMag : OpenAI's New GPT-5.2 Model Wants to Help You Automate Your Job OpenAI : Advancing science and math with GPT-5.2 OpenAI : Update to GPT-5 System Card: GPT-5.2 X: @openai : GPT-5.2 is now rolling out to everyone. https://openai.com/... Sam Altman / @sama : It is a very smart model, and we have come a long way since GPT-5.1: [image] Rohan Paul / @rohanpaul_ai : GPT-5.2 Thinking's score on GDPVal is just so massive. Unlike classic benchmarks that are just text in and text out, GDPval tasks come with reference files and context, and the expected outputs are full deliverables like documents, slides, spreadsheets, diagrams, audio, or video [image] Christian Hendriksen / @chehendriksen : Note that 5.2 Thinking gets a lot of ties to get above the 50 % mark - but 5.2 Pro has a 10 % point lead on pure wins vs 5.2 Thinking, even if the total win&tie-rate ends up being “only” 3,2 % higher. Clearly 5.2 Pro delivers more robust economically valuable quality. [image] Ethan Mollick / @emollick : @StefanFSchubert Agree, but GDPval does not really suggest autonomous knowledge work. Stefan Schubert / @stefanfschubert : It's key to read the qualification here - “well-specified knowledge work tasks”. That's not equivalent to those knowledge jobs in their entirety. @signulll : it's not the size of the model, it's how you use it. @scaling01 : GPT-5.2 Pro looks terrible here GPT-5.2 is on par with Gemini 3 Pro in terms of efficiency [image] Rohan Paul / @rohanpaul_ai : 🎯 Wow. The AI's “supersonic tsunami” is in full motion. ARC Prize just verified GPT-5.2 Pro (X-High) at 90.5% on ARC-AGI-1 for $11.64 per task Which beats prior records and shows a ~390X efficiency jump in just a year. And GPT-5.2 Pro (High) hits 54.2% on ARC-AGI-2 for [image] Prinz / @deredleritt3r : GDPval is one of the few remaining benchmarks that actually matters. - The benchmark spans tasks that request real work product in 44 occupations across 9 industries: presentations, spreadsheets, urgent care schedules, manufacturing diagrams, etc. The model's work is compared [image] Lech Mazur / @lechmazur : The high-reasoning version of GPT-5.2 improves on GPT-5.1 on the Extended NYT Connections benchmark: 69.9 → 77.9. The medium-reasoning version also improves: 62.7 → 72.1. The no-reasoning version also improves: 22.1 → 27.5. MiniMax-M2 scores 27.6. [image] Vik / @vikhyatk : only gpt-5.2-extra-high can save me now @zephyr_z9 : A new attention mechanism [image] @teortaxestex : I don't think Google will ever let other labs touch actually visually nontrivial frontend. Well, maybe for a week or so. If they blow their shot on a Hail Mary. Blind agents can be surprisingly crafty after all. Grace Li / @grx_xce : GPT-5.2 (xHigh) debuts in #3 place overall on Design Arena, 1st on Game Dev Arena, and a top 3 finish in Website Arena and Data Viz Arena Well-deserved congratulations to the team at @OpenAI - we saw a ridiculous amount of care go into this one, a meaningful step forward in [image] @scaling01 : Gemini 3 Pro and Opus 4.5 still have the lead in frontend development [image] @zephyr_z9 : OpenAI cooked with this one @zephyr_z9 : Very impressive [image] Ben Pouladian / @benitoz : GPT-5.2 is the clearest signal yet that pre-training scaling isn't slowing down Bigger corpuses, longer contexts, hotter training run Every jump like this means one thing: NVIDIA's curve is nowhere near flattening. We're still early in the compute supercycle. $nvda $orcl [image] Matt Dratch / @dratchcap : And not even a Blackwell model... Matt Shumer / @mattshumer_ : I've been using GPT-5.2 Pro for two weeks now. It's the best model in the world. It thinks for over an hour on hard problems. And it nails tasks no other model can touch. I can't live without it. Here's my GPT-5.2 Pro deep dive: https://shumer.dev/... @scaling01 : GPT-5.2 Thinking benchmarks looks very solid compared to Opus 4.5 and Gemini 3 Pro, especially on knowledge heavy tasks like GPQA-Dimaond or HLE but comes at a higher price than GPT-5.1 these two facts together suggest that GPT-5.2 is a larger model than GPT-5.1 and likely also in the same parameter range as Gemini 3 Pro Greg Brockman / @gdb : GPT-5.2 Pro is SOTA on ARC-AGI, with two orders of magnitude efficiency improvement over the past year: Alexander Doria / @dorialexander : The one interesting part of frontier release while waiting for actual user feedback is checking which evals haven't rotten yet. Go FrontierMath. Artem Russakovskii / @artemr : GPT-5.2 is a new sota programming model. Exciting. @openaidevs : [Thread] GPT-5.2 is now available in the API, priced at $1.75/1M input and $14/1M output tokens; GPT-5.2 Pro is priced at $21/1M input and $168/1M output tokens LinkedIn: Emil Protalinski : 5 things you should know about OpenAI's GPT-5.2: — 1. OpenAI has launched GPT-5.2, available in Instant, Thinking, and Pro variants … Mac Huffman : Excited to introduce GPT-5.2! OpenAI's new state-of-the-art model for real work. — Built for reliability, depth, and production-grade performance across complex tasks. … Christopher Berry : Today we are sharing GPT-5.2 with the world. This is the first model release that I have been a part of at OpenAI, so it feels particularly special to me. … Adam Hede : OpenAI GPT-5.2 was sort of inevitable... Gemini 3.0 and Opus 4.5 had seriously eaten OpenAIs lunch for a few days now, so OpenAI had to respond, and tonight we have it: GPT-5.2 … Xuedong D. Huang : We're continually pushing the boundaries of what Zoom AI Companion can deliver, and today I'm excited to share another meaningful step forward. … Olivier Godement : We're all-in on building models that have tangible, real-world impact on economically valuable tasks. Be it writing a memo deck, writing code, responding to a sales query. … Marc Manara : GPT-5.2 launched and it is a killer model. It's showing spikes on agentic coding, code review, data analysis tasks, document analysis, and many other areas. … Jeff Wang : It's potentially the most competitive last 30 days in AI Coding history. — GPT-5.2 represents the biggest leap for GPT models … Bluesky: @kashhill : Translation of this paragraph from the system card: — OpenAI employees were able to sext with this version of ChatGPT. openai.com/index/introd... [image] Cameron / @cameron.pfiffer.org : OpenAI announces GPT 5.2 — openai.com/index/intro... Highlights: — “Most advanced model for professional work and long-running agents” — 70.9% on GDPval - wins/ties vs human experts across 44 occupations — SWE-Bench Pro: 55.6% (new SOTA) Forums: Hacker News : GPT-5.2 r/OpenAI : Introducing GPT-5.2 r/NVDA_Stock : Best Model trained on Nvidia r/GithubCopilot : GPT 5.2 released - waiting for it to show up in vs code! Msmash / Slashdot : GPT-5.2 Arrives as OpenAI Scrambles To Respond To Gemini 3's Gains MacRumors Forums : OpenAI Launches GPT-5.2 for ChatGPT Users a Week After Declaring ‘Code Red’

OpenAI 2025-12-12

Chronicles

OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost