MCP Mistral OCR
Perform OCR on local files and URLs (images, PDFs) using the Mistral AI API.
MCP Mistral OCR
An MCP server that provides OCR capabilities using Mistral AI's OCR API. This server can process both local files and URLs, supporting images and PDFs.
Features
- Process local files (images and PDFs) using Mistral's OCR
- Process files from URLs with explicit file type specification
- Support for multiple file formats (JPG, PNG, PDF, etc.)
- Results saved as JSON files with timestamps
- Docker containerization
- UV package management
Environment Variables
MISTRAL_API_KEY: Your Mistral AI API keyOCR_DIR: Directory path for local file processing. Inside the container, this is always mapped to/data/ocr
Installation
Installing via Smithery
To install Mistral OCR for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @everaldo/mcp/mistral-crosswalk --client claude
Using Docker
- Build the Docker image:
docker build -t mcp-mistral-ocr .
- Run the container:
docker run -e MISTRAL_API_KEY=your_api_key -e OCR_DIR=/data/ocr -v /path/to/local/files:/data/ocr mcp-mistral-ocr
Local Development
- Install UV package manager:
pip install uv
- Create and activate virtual environment:
uv venv
source .venv/bin/activate # On Unix
# or
.venv\Scripts\activate # On Windows
- Install dependencies:
uv pip install .
Claude Desktop Configuration
Add this configuration to your claude_desktop_config.json:
{
"mcpServers": {
"mistral-ocr": {
"command": "docker",
"args": [
"run",
"-i",
"--rm",
"-e",
"MISTRAL_API_KEY",
"-e",
"OCR_DIR",
"-v",
"C:/path/to/your/files:/data/ocr",
"mcp-mistral-ocr:latest"
],
"env": {
"MISTRAL_API_KEY": "<YOUR_MISTRAL_API_KEY>",
"OCR_DIR": "C:/path/to/your/files"
}
}
}
}
Available Tools
1. process_local_file
Process a file from the configured OCR_DIR directory.
{
"name": "process_local_file",
"arguments": {
"filename": "document.pdf"
}
}
2. process_url_file
Process a file from a URL. Requires explicit file type specification.
{
"name": "process_url_file",
"arguments": {
"url": "https://example.com/document",
"file_type": "image" // or "pdf"
}
}
Output
OCR results are saved in JSON format in the output directory inside OCR_DIR. Each result file is named using the following format:
- For local files:
{original_filename}_{timestamp}.json - For URLs:
{url_filename}_{timestamp}.jsonorurl_document_{timestamp}.jsonif no filename is found in the URL
The timestamp format is YYYYMMDD_HHMMSS.
Supported File Types
- Images: JPG, JPEG, PNG, GIF, WebP
- Documents: PDF and other document formats supported by Mistral OCR
Limitations
- Maximum file size: 50MB (enforced by Mistral API)
- Maximum document pages: 1000 (enforced by Mistral API)
Related Servers
Siri Shortcuts
List, open, and run shortcuts from the macOS Shortcuts app.
Yachtsy MCP Server
Search, compare, and track sailboats with real-time listings, market insights, specs, and price history.
Chronica
Persistent memory MCP server for Claude Desktop — remembers context, time, and topics across sessions
Google Sheets
A server that connects to the Google Sheets API, enabling AI-driven spreadsheet automation and data manipulation.
ClearPolicy
ClearPolicy is a document signing and compliance tracking tool for organizations. Once connected, your AI assistant can import documents, send signature requests, track who has and hasn't signed, and manage your contacts — all by prompt.
Things 3 Extended
A desktop extension for the Things 3 task manager, providing advanced features like task movement, editing, and backups.
Jira-pilot
About AI-powered Jira CLI and MCP server for humans and agents manage issues, sprints, boards with interactive wizards, multi-provider AI
PapersFlow
Turn any AI agent into an academic researcher that can search, read, cite, and write full literature reviews autonomously.
Pulsetic MCP Server
The Pulsetic MCP Server connects Pulsetic monitoring with AI agents and MCP-compatible tools, enabling direct access to uptime data, cron monitoring results, incident management workflows, and status page information through the Model Context Protocol (MCP). It allows teams to securely expose operational monitoring data in a structured format, making it easy to build AI-driven automation, monitoring assistants, and intelligent operational workflows without custom middleware.
mpesa-mcp
MCP server for M-Pesa (Safaricom Daraja) and Africa's Talking APIs. Gives AI coding assistants — Claude Code, Cursor, GitHub Copilot — direct access to East African payment and SMS infrastructure from a single server. What it does: STK Push payments via Safaricom Daraja (triggers M-Pesa prompt on user's phone) Transaction status queries SMS to 20+ African telecom networks via Africa's Talking Airtime top-up across East and West Africa Safety: All 5 tools are annotated per MCP 2025-03-26 spec — payment and SMS tools declare destructiveHint: true, so Claude Desktop and other clients show confirmation dialogs before executing. Query tools declare readOnlyHint: true for auto-approval. Install: pip install mpesa-mcp Who it's for: Developers building AI agents for East African markets. M-Pesa handles ~$50B/year in transactions and reaches 50M+ users. Africa's Talking reaches 300M+ phones across 20+ telecoms.
