Models like o3 and Gemini 2.5 Pro feel like “Jagged AGI”: unreliable, even at some mundane tasks, but still offering superhuman capabilities in many areas
Amid today's AI boom, it's disconcerting that we still don't know how to measure how smart, creative, or empathetic these systems are.