MathEst. 2021

MATH

The MATH benchmark consists of 12,500 challenging competition mathematics problems from AMC, AIME, and Olympiad competitions. Problems span seven subjects: Prealgebra, Algebra, Number Theory, Counting and Probability, Geometry, Intermediate Algebra, and Precalculus.

Metrics

Accuracy (%) on competition math problems

Created By

Dan Hendrycks et al.

Top Model Scores

RankModelScoreDate
1GPT-5.289.6%2026-03
2Claude Opus 4.688.9%2026-02
3Gemini 3 Ultra87.3%2026-01
4DeepSeek Math V285.1%2026-01
5Grok 484.7%2026-02