Read URL MCP
Extracts web content from a URL and converts it to clean Markdown format.
Read URL MCP
A Model Context Protocol (MCP) server that provides URL reading capabilities, extracting web content and converting it to clean Markdown format.
Features
- Fetch web content from HTTP/HTTPS URLs
- Extract main content using readability algorithm
- Convert HTML to clean Markdown format
- Configurable timeout and content size limits
Installation
uv install
MCP Setting
{
"mcpServers": {
"read_url_mcp": {
"command": "uv",
"args": [
"run",
"--directory",
"<directory>",
"read_url_mcp/mcp_server"
],
"env": {
"PYTHONPATH": "<directory>"
}
}
}
}
Usage
Run the MCP server:
uv run python read_url_mcp/mcp_server.py
Available Tools
readURLMarkdown(url: str)- Fetches URL content and returns it as Markdown
Development
Code Quality
uv run ruff check # Lint code
uv run ruff format # Format code
uv run ruff check --fix # Auto-fix issues
Dependencies
- mcp[cli] - MCP framework
- requests - HTTP client
- readability-lxml - Content extraction
- html2text - HTML to Markdown conversion
Configuration
Server settings in mcp_server.py:
- Timeout: 30 seconds
- Max content length: 1MB
- User agent: my-mcp-tools/1.0
Máy chủ liên quan
Bright Data
nhà tài trợDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
YouTube Transcript Extractor
Extracts transcripts from public YouTube videos.
ScreenshotOne
Render website screenshots with ScreenshotOne
Github to MCP
Convert GitHub repositories to MCP servers automatically. Extract tools from OpenAPI, GraphQL & REST APIs for Claude Desktop, Cursor, Windsurf, Cline & VS Code. AI-powered code generation creates type-safe TypeScript/Python MCP servers. Zero config setup - just paste a repo URL. Built for AI assistants & LLM tool integration.
Mozilla Readability Parser
Extracts and transforms webpage content into clean, LLM-optimized Markdown using Mozilla's Readability algorithm.
Patchright Lite MCP Server
A server that wraps the Patchright SDK to provide stealth browser automation for AI models.
Outscraper
Access Outscraper's data extraction services for business intelligence, location data, reviews, and contact information from various online platforms.
GitPrism
GitPrism is a fast, token-efficient, stateless pipeline that converts public GitHub repositories into LLM-ready Markdown.
Website to Markdown MCP Server
Fetches and converts website content to Markdown with AI-powered cleanup, OpenAPI support, and stealth browsing.
Docs Fetch MCP Server
Fetch web page content with recursive exploration.
iReader MCP
Tools for reading and extracting content from the internet.