2024-12-25
Time
A look at the more challenging AI evaluations emerging in response to the rapid progress of models, including FrontierMath, Humanity's Last Exam, and RE-Bench
Despite their expertise, AI developers don't always know what their most advanced systems are capable of—at least, not at first. X: @tharin_p and @tharin_p X: @tharin_p : My latest piece for @TIME con...
2024-04-16
Artificial Intelligence Index
17 related
Stanford's AI Index report: training top AI models is way more expensive, AI still trails humans on complex tasks, people are more nervous about AI, and more
customer support, customer acquisition, and personalization all between 22-26%. [image] @stanfordhai : The #AIIndex2024 tracks the rise of multimodal models, major cash investments into generative AI,...
Loading articles...