OpenAI’s o3 model scores 25% on advanced mathematics test
OpenAI’s new language model o3 has achieved a 25% success rate on FrontierMath, a challenging mathematics dataset. The announcement, discussed in a blog post by the Xena Project, represents both progress and limitations in AI’s mathematical capabilities. The test consists of hundreds of complex mathematical problems requiring numerical answers that can be automatically verified. According …