AI Model Leaderboard
Compare AI models across agentic, reasoning, coding, and tool-use benchmarks.
Benchmarks
agentic
reasoning
coding
tool-use
computer-use
LiveBench
Contamination-free benchmark updated monthly with fresh questions across reasoning, math, coding, data analysis, and instruction following. Avoids leakage by using new problems from recent sources.
Reasoning31 models · % accuracy