2024-11-10
Financial Times
1 related
Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests
Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
Loading articles...