2025-10-03
Exciting to see coverage of APEX and the larger shifts happening in the human data industry in TIME as well! https://time.com/...
Mercor
Mercor launches the AI Productivity Index (APEX), which evaluates AI models' ability to perform “economically valuable knowledge work”; GPT-5 leads at 64.2%
still not production-ready Nikita Ostrovsky / Time : AI Is Learning to Do the Jobs of Doctors, Lawyers, and Consultants arXiv.org : The AI Productivity Index (APEX) Agnee Ghosh / B...
Excited to share Mercor's first benchmark! The team is already hard at work expanding the richness of this eval for the next iteration and including even more valuable job categories such as peptide dealer and chief of staff. See the full paper here: https://arxiv.org/...
Mercor
Mercor launches the AI Productivity Index (APEX), which evaluates AI models' ability to perform “economically valuable knowledge work”; GPT-5 leads at 64.2%
still not production-ready Nikita Ostrovsky / Time : AI Is Learning to Do the Jobs of Doctors, Lawyers, and Consultants arXiv.org : The AI Productivity Index (APEX) Agnee Ghosh / B...