Doc Scraper

Name: Doc Scraper
Availability: InStock
Author: Sriram-PR

Supports UI

by Sriram-PR

Convert technical documentation sites into clean Markdown for LLM ingestion and RAG pipelines.

0 stars

Works in:Cursor

Exposes:ToolsResources

View on GitHub Docs

What it does\nDoc Scraper is a high-performance Go-based web crawler specifically designed to transform complex documentation websites into structured Markdown files. It eliminates web clutter, preserves site hierarchy, and optimizes content for Large Language Models (LLMs), making it an essential tool for building RAG (Retrieval-Augmented Generation) systems.\n\n## Tools\n- `list_sites`: Lists all configured sites from the config file.\n- `get_page`: Fetches a single URL and returns content as markdown.\n- `crawl_site`: Starts a background crawl for a specific site.\n- `get_job_status`: Checks the progress of a background crawl job.\n- `search_crawled`: Searches previously crawled content within JSONL files.\n\n## Installation\nAdd the following to your `claude_desktop_config.json`:\n`json\n{\n \"mcpServers\": {\n \"doc-scraper\": {\n \"command\": \"/path/to/doc-scraper\",\n \"args\": [\"mcp-server\", \"-config\", \"/path/to/config.yaml\"]\n }\n }\n}\n`\n\n## Supported hosts\nConfirmed support for Claude Desktop, Cursor, and Claude Code.

Quick install

go install github.com/Sriram-PR/doc-scraper/cmd/doc-scraper@latest

Information

Pricing: free
Published: 4/18/2026
stars: 0

Related Apps

FinanceToolkit

MCP Server

Professional-grade financial analysis toolkit for equities, options, and risk management.

DiffSitter MCP

MCP Server

AI-powered structural code navigation using tree-sitter ASTs for semantic understanding across 14+ languages.

OpenAI Apps SDK Examples

MCP App

Official example gallery of interactive MCP widgets for ChatGPT — 3D viewers, maps, carousels, shopping carts, and more.

Human MCP

MCP Server

Give AI agents human-like senses: visual analysis, image/video generation, speech synthesis, browser automation, and advanced reasoning — 29 MCP tools in one se

Containarium

MCP Server

Self-hostable agent runtime with SSH-native isolation, eBPF egress policy, and MCP-native CLI.

Shopify MCP Server

MCP Server

Direct interaction with Shopify store data via GraphQL API for managing products, customers, and orders.

Git MCP Server

MCP Server

Full-featured Git MCP server exposing 28 tools for AI agents to clone, commit, branch, diff, merge, rebase, and more via STDIO or Streamable HTTP.

CodexPotter

MCP Server

Autonomous reconciliation loop that drives Codex to align your codebase with instructed states.

Back to Apps

Doc Scraper

Supports UI

by Sriram-PR

Convert technical documentation sites into clean Markdown for LLM ingestion and RAG pipelines.

0 stars

Works in:Cursor

Exposes:ToolsResources

View on GitHub Docs

What it does\nDoc Scraper is a high-performance Go-based web crawler specifically designed to transform complex documentation websites into structured Markdown files. It eliminates web clutter, preserves site hierarchy, and optimizes content for Large Language Models (LLMs), making it an essential tool for building RAG (Retrieval-Augmented Generation) systems.\n\n## Tools\n- `list_sites`: Lists all configured sites from the config file.\n- `get_page`: Fetches a single URL and returns content as markdown.\n- `crawl_site`: Starts a background crawl for a specific site.\n- `get_job_status`: Checks the progress of a background crawl job.\n- `search_crawled`: Searches previously crawled content within JSONL files.\n\n## Installation\nAdd the following to your `claude_desktop_config.json`:\n`json\n{\n \"mcpServers\": {\n \"doc-scraper\": {\n \"command\": \"/path/to/doc-scraper\",\n \"args\": [\"mcp-server\", \"-config\", \"/path/to/config.yaml\"]\n }\n }\n}\n`\n\n## Supported hosts\nConfirmed support for Claude Desktop, Cursor, and Claude Code.

Quick install

go install github.com/Sriram-PR/doc-scraper/cmd/doc-scraper@latest

Information

Pricing: free
Published: 4/18/2026
stars: 0

Related Apps

FinanceToolkit

MCP Server

Professional-grade financial analysis toolkit for equities, options, and risk management.

DiffSitter MCP

MCP Server

AI-powered structural code navigation using tree-sitter ASTs for semantic understanding across 14+ languages.

OpenAI Apps SDK Examples

MCP App

Official example gallery of interactive MCP widgets for ChatGPT — 3D viewers, maps, carousels, shopping carts, and more.

Human MCP

MCP Server

Give AI agents human-like senses: visual analysis, image/video generation, speech synthesis, browser automation, and advanced reasoning — 29 MCP tools in one se

Containarium

MCP Server

Self-hostable agent runtime with SSH-native isolation, eBPF egress policy, and MCP-native CLI.

Shopify MCP Server

MCP Server

Direct interaction with Shopify store data via GraphQL API for managing products, customers, and orders.

Git MCP Server

MCP Server

Full-featured Git MCP server exposing 28 tools for AI agents to clone, commit, branch, diff, merge, rebase, and more via STDIO or Streamable HTTP.

CodexPotter

MCP Server

Autonomous reconciliation loop that drives Codex to align your codebase with instructed states.

Doc Scraper

Quick install

Information

Categories

Related Apps

Doc Scraper

Quick install

Information

Categories

Related Apps