Turn websites into datasets with Scrapezy
A Model Context Protocol server for Scrapezy that enables AI models to extract structured data from websites.
extract_structured_data
- Extract structured data from a website
To install Scrapezy MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Scrapezy/mcp --client claude
npm install -g @scrapezy/mcp
There are two ways to provide your Scrapezy API key:
Environment Variable:
export SCRAPEZY_API_KEY=your_api_key
npx @scrapezy/mcp
Command-line Argument:
npx @scrapezy/mcp --api-key=your_api_key
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"scrapezy": {
"command": "npx @scrapezy/mcp --api-key=your_api_key"
}
}
}
You can use this tool in Claude with prompts like:
Please extract product information from this page: https://example.com/product
Extract the product name, price, description, and available colors.
Claude will use the MCP server to extract the requested structured data from the website.
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
MIT
Use 3,000+ pre-built cloud tools to extract data from websites, e-commerce, social media, search engines, maps, and more
Fast, token-efficient web content extraction that converts websites to clean Markdown. Features Mozilla Readability, smart caching, polite crawling with robots.txt support, and concurrent fetching with minimal dependencies.
An automated tool to search notes, retrieve content, and post comments on Xiaohongshu (RedBook) using Playwright.
MCP Server to let Claude / your AI control the browser
Fetch Bilibili video comments in bulk, including nested replies. Requires a Bilibili cookie for authentication.
Easy web data access. Simplified retrieval of information from websites and online sources.
Access YouTube video transcripts and translations using the YouTube Translate API.
Discover, extract, and interact with the web - one interface powering automated access across the public internet.
Interact with WebScraping.AI for web data extraction and scraping.
Download webpages as markdown files using the r.jina.ai service, with configurable directories and persistent settings.