Evaluation

Exact Match

Quick Answer

An evaluation metric where answers are marked correct only if they exactly match the reference.

Exact match (EM) is strict evaluation: only perfect matches are correct. EM is used for QA and math problems. EM is strict but objective. Partial credit (word overlap, semantic similarity) would be more lenient. EM might penalize reasonable variations. For reproducibility, EM is good. EM is standard in many benchmarks.

Last verified: 2026-04-08

Compare models

See how different LLMs compare on benchmarks, pricing, and speed.

Browse all models →