Skip to main content

MCP App Store

Apps & Servers Skills News Models Leaderboard Blog

AI Model Leaderboard

Compare AI models across agentic, reasoning, coding, and tool-use benchmarks.

Agentic

GAIA WebArena WebArena-Infinity τ³-bench TheAgentCompany VisualWebArena HAL METR Time Horizon BrowseComp BrowseComp (w/ Context Manage)MCP-Atlas (Public Set)Vending Bench 2

Agentic Reasoning Coding Tool Use Computer Use

Benchmarks

agentic

reasoning

coding

tool-use

computer-use

HAL

Holistic Agent Leaderboard (Princeton, ICLR 2026). Meta-leaderboard aggregating GAIA, SWE-bench, TAU-bench, CORE-Bench, USACO, and more with cost-performance Pareto analysis. Paused new model updates as of 2026; focusing on reliability.

Agentic0 models · % accuracy

No scores yet

No models have been scored on this benchmark yet.

MCP App Store

The AI ecosystem directory — MCP Apps, Agent Skills, and daily news.

Directory

MCP Apps & Servers Agent Skills AI News AI Models What are MCP Apps?

Resources

Documentation Specification GitHub

Account

Sign in Get Started Dashboard

Company

Advertise Contact Build an MCP App

Legal

Privacy Policy Terms of Use

© 2026 MCP App Store

All rights reserved.