The benchmarks AI companies brag about are obsolete: Here’s what’s replacing them
Artificial Analysis has overhauled how the AI industry measures intelligence, replacing traditional benchmarks with tests that measure whether AI can complete actual work tasks. Michael Nuñez reports for VentureBeat. The independent benchmarking organization removed three widely cited tests from its Intelligence Index, including MMLU-Pro and AIME 2025. The new version 4.0 introduces 10 evaluations focused …