AI Model Leaderboard
Compare AI models across agentic, reasoning, coding, and tool-use benchmarks.
Benchmarks
agentic
reasoning
coding
tool-use
computer-use
HAL
Holistic Agent Leaderboard (Princeton, ICLR 2026). Meta-leaderboard aggregating GAIA, SWE-bench, TAU-bench, CORE-Bench, USACO, and more with cost-performance Pareto analysis. Paused new model updates as of 2026; focusing on reliability.
Agentic0 models · % accuracy
No scores yet
No models have been scored on this benchmark yet.