CodeEst. 2021

MBPP (Mostly Basic Python Problems)

MBPP consists of around 1,000 crowd-sourced Python programming problems designed to be solvable by entry-level programmers. Each problem includes a task description, code solution, and three automated test cases.

Metrics

Pass@1 (%) on ~1,000 Python problems

Created By

Google Research

Top Model Scores

RankModelScoreDate
1Claude Opus 4.693.8%2026-02
2GPT-5.293.2%2026-03
3Gemini 3 Ultra91.7%2026-01
4DeepSeek Coder V390.4%2026-01
5Grok 489.6%2026-02