Scrapezy
chính thứcTurn websites into datasets with Scrapezy
@scrapezy/mcp MCP Server
A Model Context Protocol server for Scrapezy that enables AI models to extract structured data from websites.
Features
Tools
extract_structured_data- Extract structured data from a website- Takes URL and prompt as required parameters
- Returns structured data extracted from the website based on the prompt
- The prompt should clearly describe what data to extract from the website
Installation
Installing via Smithery
To install Scrapezy MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Scrapezy/mcp --client claude
Manual Installation
npm install -g @scrapezy/mcp
Usage
API Key Setup
There are two ways to provide your Scrapezy API key:
-
Environment Variable:
export SCRAPEZY_API_KEY=your_api_key npx @scrapezy/mcp -
Command-line Argument:
npx @scrapezy/mcp --api-key=your_api_key
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"scrapezy": {
"command": "npx @scrapezy/mcp --api-key=your_api_key"
}
}
}
Example Usage in Claude
You can use this tool in Claude with prompts like:
Please extract product information from this page: https://example.com/product
Extract the product name, price, description, and available colors.
Claude will use the MCP server to extract the requested structured data from the website.
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
License
MIT
Máy chủ liên quan
Bright Data
nhà tài trợDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
Airbnb MCP Server
Search for Airbnb listings and retrieve detailed information without an API key.
Read URL MCP
Extracts web content from a URL and converts it to clean Markdown format.
Skrapr
An intelligent web scraping tool using AI and browser automation to extract structured data from websites.
Lightpanda Go MCP server
A Go-based MCP server for interacting with the Lightpanda Browser using the Chrome DevTools Protocol (CDP).
Any Browser MCP
Attaches to existing browser sessions using the Chrome DevTools Protocol for automation and interaction.
MCP NPX Fetch
Fetch and transform web content into various formats like HTML, JSON, Markdown, or Plain Text.
Web Scraper Service
A Python-based MCP server for headless web scraping. It extracts the main text content from web pages and outputs it as Markdown, text, or HTML.
Simple MCP Tool Server
A simple MCP server that provides a tool for fetching website content using SSE transport.
Leapfrog
Multi-session browser MCP for AI agents — stealth mode, session pooling, humanization, 10x fewer tokens than Playwright
CrawlForge MCP
CrawlForge MCP is a production-ready MCP server with 18 web scraping tools for AI agents. It gives Claude, Cursor, and any MCP-compatible client the ability to fetch URLs, extract structured data with CSS/XPath selectors, run deep multi-step research, bypass anti-bot detection with TLS fingerprint randomization, process documents, monitor page changes, and more. Credit-based pricing with a free tier (1,000 credits/month, no credit card required).