MCPBench

Name: MCPBench
Availability: InStock
Author: modelscope

by modelscope

Comprehensive evaluation benchmark for MCP servers focusing on task accuracy, latency, and token usage.

0 stars

Works in:claude

Exposes:ToolsResources

View on GitHub Docs

What it does

MCPBench is an evaluation framework designed to rigorously test and compare the performance of Model Context Protocol (MCP) servers. It provides a standardized way to measure how different servers handle specific task categories, ensuring developers can optimize for accuracy and efficiency.

Tools

As a benchmarking framework, MCPBench evaluates the following capabilities of target servers:

Web Search Evaluation: Tests accuracy and latency for web-based retrieval tasks.
Database Query Evaluation: Benchmarks the ability to interact with and query databases.
GAIA Evaluation: Tests general AI assistant capabilities in complex real-world scenarios.

Installation

To run the benchmark, clone the repository and set up the environment:

conda create -n mcpbench python=3.11 -y
conda activate mcpbench
pip install -r requirements.txt

Configure your servers in the configs folder and run the evaluation scripts (e.g., sh evaluation_websearch.sh your_config.json).

Supported hosts

claude

Quick install

pip install -r requirements.txt

Information

Pricing: free
Published: 4/15/2026
stars: 0

Related Apps

FinanceToolkit

MCP Server

Professional-grade financial analysis toolkit for equities, options, and risk management.

DiffSitter MCP

MCP Server

AI-powered structural code navigation using tree-sitter ASTs for semantic understanding across 14+ languages.

OpenAI Apps SDK Examples

MCP App

Official example gallery of interactive MCP widgets for ChatGPT — 3D viewers, maps, carousels, shopping carts, and more.

Human MCP

MCP Server

Give AI agents human-like senses: visual analysis, image/video generation, speech synthesis, browser automation, and advanced reasoning — 29 MCP tools in one se

Containarium

MCP Server

Self-hostable agent runtime with SSH-native isolation, eBPF egress policy, and MCP-native CLI.

Shopify MCP Server

MCP Server

Direct interaction with Shopify store data via GraphQL API for managing products, customers, and orders.

Git MCP Server

MCP Server

Full-featured Git MCP server exposing 28 tools for AI agents to clone, commit, branch, diff, merge, rebase, and more via STDIO or Streamable HTTP.

CodexPotter

MCP Server

Autonomous reconciliation loop that drives Codex to align your codebase with instructed states.

Back to Apps

MCPBench

by modelscope

Comprehensive evaluation benchmark for MCP servers focusing on task accuracy, latency, and token usage.

0 stars

Works in:claude

Exposes:ToolsResources

View on GitHub Docs

What it does

Tools

As a benchmarking framework, MCPBench evaluates the following capabilities of target servers:

Web Search Evaluation: Tests accuracy and latency for web-based retrieval tasks.
Database Query Evaluation: Benchmarks the ability to interact with and query databases.
GAIA Evaluation: Tests general AI assistant capabilities in complex real-world scenarios.

Installation

To run the benchmark, clone the repository and set up the environment:

conda create -n mcpbench python=3.11 -y
conda activate mcpbench
pip install -r requirements.txt

Configure your servers in the configs folder and run the evaluation scripts (e.g., sh evaluation_websearch.sh your_config.json).

Supported hosts

claude

Quick install

pip install -r requirements.txt

Information

Pricing: free
Published: 4/15/2026
stars: 0

Related Apps

FinanceToolkit

MCP Server

Professional-grade financial analysis toolkit for equities, options, and risk management.

DiffSitter MCP

MCP Server

AI-powered structural code navigation using tree-sitter ASTs for semantic understanding across 14+ languages.

OpenAI Apps SDK Examples

MCP App

Official example gallery of interactive MCP widgets for ChatGPT — 3D viewers, maps, carousels, shopping carts, and more.

Human MCP

MCP Server

Give AI agents human-like senses: visual analysis, image/video generation, speech synthesis, browser automation, and advanced reasoning — 29 MCP tools in one se

Containarium

MCP Server

Self-hostable agent runtime with SSH-native isolation, eBPF egress policy, and MCP-native CLI.

Shopify MCP Server

MCP Server

Direct interaction with Shopify store data via GraphQL API for managing products, customers, and orders.

Git MCP Server

MCP Server

Full-featured Git MCP server exposing 28 tools for AI agents to clone, commit, branch, diff, merge, rebase, and more via STDIO or Streamable HTTP.

CodexPotter

MCP Server

Autonomous reconciliation loop that drives Codex to align your codebase with instructed states.

MCPBench

What it does

Tools

Installation

Supported hosts

Quick install

Information

Categories

Related Apps

MCPBench

What it does

Tools

Installation

Supported hosts

Quick install

Information

Categories

Related Apps