Scrapezy
公式Turn websites into datasets with Scrapezy
@scrapezy/mcp MCP Server
A Model Context Protocol server for Scrapezy that enables AI models to extract structured data from websites.
Features
Tools
extract_structured_data- Extract structured data from a website- Takes URL and prompt as required parameters
- Returns structured data extracted from the website based on the prompt
- The prompt should clearly describe what data to extract from the website
Installation
Installing via Smithery
To install Scrapezy MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Scrapezy/mcp --client claude
Manual Installation
npm install -g @scrapezy/mcp
Usage
API Key Setup
There are two ways to provide your Scrapezy API key:
-
Environment Variable:
export SCRAPEZY_API_KEY=your_api_key npx @scrapezy/mcp -
Command-line Argument:
npx @scrapezy/mcp --api-key=your_api_key
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"scrapezy": {
"command": "npx @scrapezy/mcp --api-key=your_api_key"
}
}
}
Example Usage in Claude
You can use this tool in Claude with prompts like:
Please extract product information from this page: https://example.com/product
Extract the product name, price, description, and available colors.
Claude will use the MCP server to extract the requested structured data from the website.
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
License
MIT
関連サーバー
Bright Data
スポンサーDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
MCP Node Fetch
Fetch web content using the Node.js undici library.
MCP Chrome Server
A server for browser automation using Google Chrome, based on the MCP framework.
Web Scout
An MCP server for web search and content extraction using DuckDuckGo.
Website Snapshot
A MCP server that provides comprehensive website snapshot capabilities using Playwright. This server enables LLMs to capture and analyze web pages through structured accessibility snapshots, network monitoring, and console message collection.
Unchained Sky
Browser automation MCP server that connects AI agents to your real Chrome browser with structured page understanding in ~500 tokens
comet-mcp
Connect Claude Code to Perplexity Comet browser for agentic web browsing, deep research, and real-time task monitoring
MCP YouTube Transcript Server
Retrieves transcripts from YouTube videos for content analysis and processing.
HTML to Markdown MCP
Fetch web pages and convert HTML to clean, formatted Markdown. Handles large pages with automatic file saving to bypass token limits.
ScrAPI MCP Server
A server for scraping web pages using the ScrAPI API.
TheirStack
Search job postings, companies, and technology stacks across 10M+ companies.