What it does

WebClaw is a high-quality web extraction engine that fetches and cleans web pages (markdown, text, or JSON), handles bot protections automatically using cloud fallback, and provides endpoints for scrape, crawl, map, extract, summarize, diff, and watch. It's optimized for producing LLM-friendly outputs and structured data extraction.

When to use it

Use WebClaw when web_fetch fails (blocked by Cloudflare/DataDome or requires JS rendering), when you need structured extraction (pricing tables, product specs), when crawling sites at scale, or when you need monitoring/diffing of page content over time. Ideal for RAG pipelines and automated research.

What's included

Scripts: CLI and REST API examples; install options and cURL examples in the SKILL.md.
References: API contract details and endpoint examples included in the skill body.
Instructions: detailed endpoint usage for scrape, crawl, map, batch, extract, summarize, diff, brand, search, research, agent-scrape, and watch, plus tips and format guidance.

Compatible agents

Good for agents with network access and API-key support; integrates well with LLM workflows that need reliable web content, and with MCP server setups for self-hosting.

WebClaw

What it does

When to use it

What's included

Compatible agents

Tags

Not yet audited

Information

Related Skills