Web Scraping MCP Servers

Compare MCP servers for page extraction, crawling, structured data scraping, browser rendering, and agent workflows that need reliable web data.

一致する MCP サーバー

既存の MCP Servers ディレクトリから取得しており、個別のトピックデータベースはありません。

すべての検索結果を見る
AgentKey
Unified MCP server giving any AI agent access to 20+ social platforms, web search, scraping, and crypto data — one install, any MCP client.
サーバーを見る
CrawlForge MCP
CrawlForge MCP is a production-ready MCP server with 18 web scraping tools for AI agents. It gives Claude, Cursor, and any MCP-compatible client the ability to fetch URLs, extract structured data with CSS/XPath selectors, run deep multi-step research, bypass anti-bot detection with TLS fingerprint randomization, process documents, monitor page changes, and more. Credit-based pricing with a free tier (1,000 credits/month, no credit card required).
サーバーを見る
Opengraph.io
Opengraph data, web scraping, screenshot features in a handy MCP tool
サーバーを見る
Pylon
20+ pay-per-request APIs for AI agents — screenshots, web scraping, PDF, OCR, search, QR codes, translation & more. No API keys needed. Pay with USDC via x402. npm: @pylonapi/mcp
サーバーを見る
Open Crawler MCP Server
A web crawler and content extractor that supports multiple output formats like text, markdown, and JSON.
サーバーを見る
Open Crawler MCP Server
A web crawler and text extractor with robots.txt compliance, rate limiting, and page size protection.
サーバーを見る
Firecrawl MCP
公式
Adds powerful web scraping and search capabilities to LLM clients like Cursor and Claude.
サーバーを見る
Apify
公式
Official Apify MCP server for AI agents to run Actors, extract website data, and automate web scraping and crawling workflows.
サーバーを見る
Airtable MCP Server
Apify-hosted MCP server for Airtable with 15 tools. Full CRUD for records, tables, fields, search, and schema inspection. No local setup needed.
サーバーを見る
ClickUp MCP Server
Apify-hosted MCP server for ClickUp with 20 tools. Tasks, spaces, folders, lists, views, docs, and custom fields. No local setup needed.
サーバーを見る
Storybook MCP Server
Apify-hosted MCP server for Storybook. Browse components, inspect props, read stories, capture screenshots. Supports Storybook 6/7/8.
サーバーを見る
Webflow MCP Server
Apify-hosted MCP server for Webflow with 22+ tools. Sites, CMS collections, pages, content management, and publishing. No local setup needed.
サーバーを見る

Web Scraping MCP が適した場面

Extract structured page data for research, lead enrichment, monitoring, and internal datasets.

Combine crawling, search, and browser rendering when agents need more than a single static page.

Route scraping through purpose-built APIs or controlled browser sessions instead of ad hoc scripts.

セットアップチェックリスト

  1. 1Choose a scraping server based on whether you need static extraction, JavaScript rendering, crawling, or proxy-backed APIs.
  2. 2Configure API keys, proxy settings, browser permissions, and rate limits with the smallest useful scope.
  3. 3Add the server command or remote endpoint to your MCP client configuration.
  4. 4Test a small extraction task and confirm the output includes source URLs, limits, and the structured fields you expect.

選び方

  • Check support for JavaScript rendering, pagination, crawling depth, selectors, screenshots, retries, and timeouts.
  • Prefer servers that return source-aware structured data and make request limits visible.
  • Use dedicated scraping APIs for scale or anti-bot complexity, and browser automation for interactive or visual workflows.

Web Scraping MCP FAQ

What is Web Scraping MCP?

Web Scraping MCP connects an AI client to extraction, crawling, or browser-rendering tools so agents can fetch pages and turn web content into structured data through MCP.

How is Web Scraping MCP different from Browser Automation MCP?

Web scraping focuses on extraction, crawling, and data pipelines. Browser automation focuses on interaction, UI state, testing, and live page control. They overlap when a site needs JavaScript rendering or logged-in sessions.

Is web scraping with MCP safe?

It is safest when you rate-limit requests, respect site terms and robots policies, avoid sensitive data, and keep long-running jobs scoped to approved domains.