Scrapezy
Extract structured data from websites using the Scrapezy API.
@scrapezy/mcp MCP Server
A Model Context Protocol server for Scrapezy that enables AI models to extract structured data from websites.
Features
Tools
extract_structured_data- Extract structured data from a website- Takes URL and prompt as required parameters
- Returns structured data extracted from the website based on the prompt
- The prompt should clearly describe what data to extract from the website
Installation
Installing via Smithery
To install Scrapezy MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Scrapezy/mcp --client claude
Manual Installation
npm install -g @scrapezy/mcp
Usage
API Key Setup
There are two ways to provide your Scrapezy API key:
-
Environment Variable:
export SCRAPEZY_API_KEY=your_api_key npx @scrapezy/mcp -
Command-line Argument:
npx @scrapezy/mcp --api-key=your_api_key
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"scrapezy": {
"command": "npx @scrapezy/mcp --api-key=your_api_key"
}
}
}
Example Usage in Claude
You can use this tool in Claude with prompts like:
Please extract product information from this page: https://example.com/product
Extract the product name, price, description, and available colors.
Claude will use the MCP server to extract the requested structured data from the website.
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
License
MIT
相關伺服器
Bright Data
贊助Discover, extract, and interact with the web - one interface powering automated access across the public internet.
Icypeas
Icypeas MCP allows agents to source leads in a gigantic +575M lead database, and enrich these leads with emails
Crawl4AI RAG
Integrates web crawling and Retrieval-Augmented Generation (RAG) into AI agents and coding assistants.
APIMesh
18 x402-payable web analysis APIs for AI agents — pay per call with USDC on Base, no API keys needed
Stepstone
Fetches job listings from Stepstone.de based on keywords and location parameters.
brosh
A browser screenshot tool to capture scrolling screenshots of webpages using Playwright, with support for intelligent section identification and multiple output formats.
Jina Reader
Fetch the content of a remote URL as Markdown with Jina Reader.
webcheck-mcp
Website health checker MCP server - SEO audit, accessibility scan, broken link detection, performance analysis, and page comparison.
MCP Query Table
Query financial web tables from sources like iwencai, tdx, and eastmoney using Playwright.
Fetch
Fetch web content as HTML, JSON, plain text, or Markdown.
Puppeteer Vision
Scrape webpages and convert them to markdown using Puppeteer. Features AI-driven interaction capabilities.