PDF MCP Server
Extract text and data from PDF files using pdfplumber by providing local file paths.
PDF MCP Server
A simple Model Context Protocol (MCP) server that provides PDF reading capabilities using pdfplumber.
Features
- extract_text: Extract all text content from PDF files
- get_pdf_info: Get basic PDF information (page count, metadata, dimensions)
- extract_tables: Extract tables from PDF files
Installation
Option 1: Without installation (using uvx)
You can run the server directly without installing it:
uvx --from git+https://github.com/gstiebler/pdf-mcp-server.git pdf-mcp-server
Option 2: Local installation
Clone the repository and install dependencies:
git clone https://github.com/gstiebler/pdf-mcp-server.git
cd pdf-mcp-server
uv sync
Usage
If installed locally:
python main.py
The server runs using stdio transport and can be integrated with any MCP client.
Available Tools
extract_text
Extracts all text content from a PDF file.
Parameters:
file_path(string): Path to the PDF file
Returns:
- Complete text content from all pages
get_pdf_info
Gets basic information about a PDF file.
Parameters:
file_path(string): Path to the PDF file
Returns:
- Dictionary containing:
page_count: Number of pagesmetadata: PDF metadata (if available)page_dimensions: Width and height of the first page
extract_tables
Extracts tables from a PDF file.
Parameters:
file_path(string): Path to the PDF filepage_number(integer, optional): Specific page number to extract from (1-indexed)
Returns:
- List of tables with:
page: Page number where the table was founddata: Table data as a list of rows (each row is a list of cell values)
Integration with Claude Desktop
To use this server with Claude Desktop, add it to your Claude Desktop configuration:
Option 1: Direct from GitHub (recommended)
{
"mcpServers": {
"pdf-reader": {
"command": "uvx",
"args": ["--from", "git+https://github.com/gstiebler/pdf-mcp-server.git", "pdf-mcp-server"]
}
}
}
Option 2: From local installation
{
"mcpServers": {
"pdf-reader": {
"command": "python",
"args": ["/path/to/pdf-mcp-server/main.py"]
}
}
}
Requirements
- Python 3.10+
- pdfplumber
- mcp
Máy chủ liên quan
ZIP MCP Server
Compress and decompress ZIP files, with support for multi-file packaging, password protection, and encryption.
MCP File Edit
Perform file system operations such as reading, writing, patching, and managing directories.
IDE MEMORY MCP
IDE Memory MCP gives AI coding agents a persistent memory layer that works across any IDE supporting the Model Context Protocol. Write project context once — the AI remembers it everywhere.
Music Collection MCP Server
An MCP server for intelligent access to your local music collection, featuring metadata management, band discovery, and advanced querying.
Excel/CSV MCP Server
Read, analyze, and manipulate data in Excel (XLSX, XLS) and CSV files with advanced filtering and analytics.
Basic Memory
Build a persistent, local knowledge base in Markdown files through conversations with LLMs.
YggTorrent
A server to programmatically interact with the YggTorrent file-sharing platform.
File System MCP Server
A server for comprehensive file and directory management on the local file system.
FilerMoverMcp
A file mover tool that stages and executes file moves safely. Works as both a CLI tool and an MCP server for AI agents.
MCP Start App
An MCP server for local file management and system operations.