2025-10-03
Evaluations like APEX are HUGELY important for guiding the direction of AI research and measuring the impact it will have on the economy. Mercor is leading the charge here in a big way. Huge congrats to @BrendanFoody and the entire team.
Mercor
Mercor launches the AI Productivity Index (APEX), which evaluates AI models' ability to perform “economically valuable knowledge work”; GPT-5 leads at 64.2%
still not production-ready Nikita Ostrovsky / Time : AI Is Learning to Do the Jobs of Doctors, Lawyers, and Consultants arXiv.org : The AI Productivity Index (APEX) Agnee Ghosh / B...