2025-10-03
Huge shoutout to my incredible colleagues, who have worked extremely hard on APEX this year - super fascinating work evaluating models on consulting, legal, financial, and medical tasks And so fun to see that my former HLS Professor, @CassSunstein, collaborated with @mercor_ai
Mercor
Mercor launches the AI Productivity Index (APEX), which evaluates AI models' ability to perform “economically valuable knowledge work”; GPT-5 leads at 64.2%
still not production-ready Nikita Ostrovsky / Time : AI Is Learning to Do the Jobs of Doctors, Lawyers, and Consultants arXiv.org : The AI Productivity Index (APEX) Agnee Ghosh / B...