Scrapezy
oficialTurn websites into datasets with Scrapezy
@scrapezy/mcp MCP Server
A Model Context Protocol server for Scrapezy that enables AI models to extract structured data from websites.
Features
Tools
extract_structured_data- Extract structured data from a website- Takes URL and prompt as required parameters
- Returns structured data extracted from the website based on the prompt
- The prompt should clearly describe what data to extract from the website
Installation
Installing via Smithery
To install Scrapezy MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Scrapezy/mcp --client claude
Manual Installation
npm install -g @scrapezy/mcp
Usage
API Key Setup
There are two ways to provide your Scrapezy API key:
-
Environment Variable:
export SCRAPEZY_API_KEY=your_api_key npx @scrapezy/mcp -
Command-line Argument:
npx @scrapezy/mcp --api-key=your_api_key
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"scrapezy": {
"command": "npx @scrapezy/mcp --api-key=your_api_key"
}
}
}
Example Usage in Claude
You can use this tool in Claude with prompts like:
Please extract product information from this page: https://example.com/product
Extract the product name, price, description, and available colors.
Claude will use the MCP server to extract the requested structured data from the website.
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
License
MIT
Servidores relacionados
Bright Data
patrocinadorDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
LinkedIn MCP
Scrape LinkedIn profiles and companies, get recommended jobs, and perform job searches.
Urlbox Full Page Screenshots
An MCP server for the Urlbox Screenshot API. It enables your client to take screenshots, generate PDFs, extract HTML/markdown, and more from websites.
Fetch MCP Server
Fetches web content from a URL and converts it from HTML to markdown for easier consumption by LLMs.
Intercept
Give your AI the ability to read the web. Fetches URLs as clean markdown with 9 fallback strategies. Handles tweets, YouTube, arXiv, PDFs, and regular pages.
Reddit MCP
A server to browse, search, and read content on Reddit using the Reddit API.
Website to Markdown MCP Server
Fetches and converts website content to Markdown with AI-powered cleanup, OpenAPI support, and stealth browsing.
HasData
HasData APIs - Google SERP, Amazon, Zillow, Indeed, Maps, and more
transcriptor-mcp
An MCP server (stdio + HTTP/SSE) that fetches video transcripts/subtitles via yt-dlp, with pagination for large responses. Supports YouTube, Twitter/X, Instagram, TikTok, Twitch, Vimeo, Facebook, Bilibili, VK, Dailymotion. Whisper fallback — transcribes audio when subtitles are unavailable (local or OpenAI API). Works with Cursor and other MCP host
Fetch
Fetch web content in various formats like HTML, JSON, plain text, and Markdown.
MCP Video Download URL Parser
Download watermark-free videos from platforms like Douyin and TikTok.