VisionEst. 2020

DocVQA

Document Visual Question Answering evaluates the ability of models to understand and answer questions about document images including forms, invoices, scientific papers, and handwritten notes.

Metrics

ANLS score on document questions

Created By

CVC Barcelona

Top Model Scores

RankModelScoreDate
1Gemini 3 Ultra95.2%2026-01
2GPT-5.294.7%2026-03
3Claude Opus 4.693.8%2026-02
4InternVL 391.4%2026-01
5Qwen2-VL 72B89.6%2025-12