MCP Server Convert
Document conversion MCP server — PDF, DOCX, HTML, EPUB to Markdown with 6 tools and Docker support
mcp-server-convert
A lightweight Model Context Protocol (MCP) server that converts documents to Markdown. Supports PDF, DOCX, HTML, EPUB, CSV, JSON, and plain text files.
Perfect for AI agents that need to ingest and understand document content.
Features
- 📄 Multi-format support: PDF, DOCX, HTML, EPUB, CSV, JSON, images (via OCR), and plain text
- 🔧 6 MCP tools:
convert_file,convert_url,list_supported_formats,batch_convert,extract_metadata,convert_directory - 🐍 Zero external dependencies for core: Uses Python standard library +
markdownifyfor HTML - ⚡ Fast: In-memory processing, no temp files
- 🐳 Docker-ready: Single Dockerfile, one command deploy
Quick Start
Install & Run
# Clone
git clone https://github.com/demo112/mcp-server-convert.git
cd mcp-server-convert
# Install dependencies
pip install -r requirements.txt
# Run
python -m mcp_server_convert
Configure in Claude Code
Add to your MCP settings (~/.claude/settings.json):
{
"mcpServers": {
"convert": {
"command": "python",
"args": ["-m", "mcp_server_convert"],
"cwd": "/path/to/mcp-server-convert"
}
}
}
Docker
docker build -t mcp-server-convert .
docker run -i --rm mcp-server-convert
Configure with Docker
{
"mcpServers": {
"convert": {
"command": "docker",
"args": ["run", "-i", "--rm", "-v", "/path/to/files:/data", "mcp-server-convert"]
}
}
}
Tools
convert_file
Convert a local file to Markdown.
Parameters:
file_path(string, required): Absolute path to the filemax_length(int, optional): Maximum output length in chars (default: 50000)
convert_url
Fetch a URL and convert its content to Markdown.
Parameters:
url(string, required): URL to fetch and convertmax_length(int, optional): Maximum output length in chars (default: 50000)
batch_convert
Convert multiple files at once.
Parameters:
file_paths(array of strings, required): List of file pathsmax_length_per_file(int, optional): Max length per file (default: 50000)
convert_directory
Convert all supported files in a directory.
Parameters:
dir_path(string, required): Path to directoryrecursive(bool, optional): Include subdirectories (default: true)max_files(int, optional): Maximum files to convert (default: 20)
extract_metadata
Extract metadata from a file without full conversion.
Parameters:
file_path(string, required): Path to the file
list_supported_formats
List all supported file extensions and their conversion methods.
Supported Formats
| Format | Extension | Method |
|---|---|---|
.pdf | PyMuPDF (fitz) | |
| Word | .docx | python-docx |
| HTML | .html, .htm | markdownify |
| EPUB | .epub | ebooklib |
| CSV | .csv | pandas → markdown table |
| JSON | .json | Formatted markdown code block |
| XML | .xml | xmltodict → markdown |
| Excel | .xlsx | openpyxl → markdown table |
| PowerPoint | .pptx | python-pptx → markdown slides |
| Text | .txt, .md, .rst, .log | Direct passthrough |
| Images | .png, .jpg | pytesseract OCR (if available) |
Support
If this tool helps your workflow, consider supporting its development:
- GitHub Sponsors: Sponsor via Liberapay
- ETH:
0xddD9f45e14c92846f47C1c1A4431aC2b41D87273
License
MIT
関連サーバー
Filesystem MCP Server
Provides file system operations, analysis, and manipulation capabilities through a standardized tool interface.
Fast Filesystem MCP
A high-performance Model Context Protocol (MCP) server that provides secure filesystem access and AI-optimized code development tools for Claude and other AI assistants.
SharePoint MCP Server
Browse and interact with Microsoft SharePoint sites and documents.
Akyn AI
Knowledge bases for AI agents via MCP
Local Utilities
Provides essential utility tools for text processing, file operations, and system tasks.
Music Collection MCP Server
An MCP server for intelligent access to your local music collection, featuring metadata management, band discovery, and advanced querying.
Obsidian MCP Server - Enhanced
Provides comprehensive access to an Obsidian vault, allowing AI agents to read, write, search, and manage notes via the Local REST API plugin.
Filesystem MCP Server
A secure server for filesystem operations with controlled access to specified directories.
HDFS MCP Server
Access and manage files on HDFS clusters using the MCP protocol, supporting operations like upload, download, move, and copy.
Cortex
Ontology driven knowledge system with formal OWL-RL reasoning, SPARQL graph + SQLite dual store, and self-improving memory tiers. 22 MCP tools for capture, search, reasoning, graph operations, and diagnostics. Local-first.