AI Model Evaluation Benchmarks

A list of mainstream AI model evaluation benchmarks. View benchmark descriptions and model rankings.

About Evaluation Benchmarks

Each evaluation benchmark is designed to measure specific capabilities of AI models. You can evaluate model performance from diverse angles including mathematical reasoning, coding, and comprehensive understanding. Data is updated regularly to reflect the latest model scores.