Doc Reading and Converter
A server for reading and converting documents between PDF, DOCX, and Markdown formats using marker-pdf and pandoc.
Document Reading and Converter Tool
The goal is to have a tool that can read and edit files, using Cursor or Claude Desktop.
The filesystem MCP already allows reading, writing and editing documents, but is limited in that it cannot:
- Read pdfs or docx files
- Reads the full document, which can cause context overflow
To address these issues, we create:
- A fresh
doc-reading-mcpmcp service allowing for document conversions between pdf, docx and markdown.
Features
- PDF to Markdown conversion using marker-pdf
- DOCX to Markdown conversion using pandoc
- Markdown to DOCX conversion using pandoc
- Markdown to PDF conversion using pandoc
Prerequisites
- Python 3.10 or higher
- [pandoc] (https://pandoc.org/installing.html) installed on your system
- [uv] (https://docs.astral.sh/uv/) for Python package management
Installation
uv add doc-reading-mcp
Or install directly from the source:
uv add git+https://github.com/mffrydman/doc-reading-mcp.git
Usage
Run as a standalone server
uv run mcp install -m doc_reading_mcp
Or run directly:
uv run -m doc_reading_mcp
Use with MCP Inspector
npx @modelcontextprotocol/inspector uvx run -m doc_reading_mcp
Configure in Claude Desktop, Cursor or Windsurf
Add this to your MCP Configuration:
{
"mcpServers": {
"doc-reading-mcp": {
"command": "uvx",
"args": [
"doc-reading-mcp",
]
}
}
}
Run in Cursor/Windsurf/Claude using the following configuration:
"doc-reading-mcp": {
"command": "uv",
"args": [
"--directory",
"/absolute/path/to/mffrydman/doc-reading-mcp",
"run",
"-m",
"doc_reading_mcp"
]
}
Replace
/absolute/path/to/with the actual path on your system.
Похожие серверы
Todoist
Interact with the Todoist API to manage your tasks using natural language.
MCP Google Workspace
A comprehensive MCP server for managing Google Workspace services like Calendar, Contacts, and Gmail using OAuth2 authentication.
Agent Collaboration
A dynamic server for managing multi-agent collaboration within tmux sessions, featuring flexible pane configurations and auto-detection.
Planfix
An MCP server for integrating with the Planfix project management and CRM platform.
Rememberizer
Interact with Rememberizer's document and knowledge management API to search, retrieve, and manage documents.
Feishu/Lark OpenAPI MCP
Connect AI agents with the Feishu/Lark platform for automation, including document processing, conversation management, and calendar scheduling.
DeepSRT
Summarize YouTube videos using the DeepSRT API.
memory-mcp-1file
🏠 🍎 🪟 🐧 - A self-contained Memory server with single-binary architecture (embedded DB & models, no dependencies). Provides persistent semantic and graph-based memory for AI agents.
ContentFlow
Business Intelligence from podcasts and videos
Ads MCP
Remote MCP server for Ad campaign planning, research, and cross-platform ad creation. Supports Google Ads Search & Performance Max and TikTok at launch, with additional networks planned.