AI-powered web scraping using the ScrapeGraph AI API. Requires an API key.
A production-ready Model Context Protocol (MCP) server that provides seamless integration with the ScapeGraph AI API. This server enables language models to leverage advanced AI-powered web scraping capabilities with enterprise-grade reliability.
The server provides the following enterprise-ready tools:
markdownify(website_url: str)
: Transform any webpage into clean, structured markdown formatsmartscraper(user_prompt: str, website_url: str)
: Leverage AI to extract structured data from any webpagesearchscraper(user_prompt: str)
: Execute AI-powered web searches with structured, actionable resultsTo utilize this server, you'll need a ScapeGraph API key. Follow these steps to obtain one:
For automated installation of the ScrapeGraph API Integration Server using Smithery:
npx -y @smithery/cli install @ScrapeGraphAI/scrapegraph-mcp --client claude
Update your Claude Desktop configuration file with the following settings (located on the top rigth of the Cursor page):
(remember to add your API key inside the config)
{
"mcpServers": {
"@ScrapeGraphAI-scrapegraph-mcp": {
"command": "npx",
"args": [
"-y",
"@smithery/cli@latest",
"run",
"@ScrapeGraphAI/scrapegraph-mcp",
"--config",
"\"{\\\"scrapegraphApiKey\\\":\\\"YOUR-SGAI-API-KEY\\\"}\""
]
}
}
}
The configuration file is located at:
%APPDATA%/Claude/claude_desktop_config.json
~/Library/Application\ Support/Claude/claude_desktop_config.json
Add the ScrapeGraphAI MCP server on the settings:
The server enables sophisticated queries such as:
The server implements robust error handling with detailed, actionable error messages for:
When running on Windows systems, you may need to use the following command to connect to the MCP server:
C:\Windows\System32\cmd.exe /c npx -y @smithery/cli@latest run @ScrapeGraphAI/scrapegraph-mcp --config "{\"scrapegraphApiKey\":\"YOUR-SGAI-API-KEY\"}"
This ensures proper execution in the Windows environment.
This project is distributed under the MIT License. For detailed terms and conditions, please refer to the LICENSE file.
Special thanks to tomekkorbak for his implementation of oura-mcp-server, which served as starting point for this repo.
Made with ❤️ by ScrapeGraphAI Team
Fetch Bilibili video comments in bulk, including nested replies. Requires a Bilibili cookie for authentication.
A Java-based MCP server for interacting with the Crawl4ai web scraping API.
An MCP server for advanced web crawling, content extraction, and AI-powered analysis using the crawl4ai library.
Playwright MCP server
Fetches and converts website content to Markdown with AI-powered cleanup, OpenAPI support, and stealth browsing.
Fetch YouTube subtitles
An automated tool to search notes, retrieve content, and post comments on Xiaohongshu (RedBook) using Playwright.
A server for browser automation using Playwright, providing powerful tools for web scraping, testing, and automation.
Enable AI agents to get structured data from unstructured web with AgentQL.
Download webpages as markdown files using the r.jina.ai service, with configurable directories and persistent settings.