unstructured-mcp
Provides unstructured document processing capabilities using the Unstructured API.
A Model Context Protocol server that provides unstructured document processing capabilities. This server enables LLMs to extract and use content from an unstructured document.
This repo is work in progress, proceed with caution :)
Supported file types:
{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot",
".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html",
".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages",
".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst",
".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1",
".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml",
".zabw"}
Prerequisites: You'll need:
- Unstructured API key. Learn how to obtain one here
- Claude Desktop installed locally
Quick TLDR on how to add this MCP to your Claude Desktop:
- Clone the repo and set up the UV environment.
- Create a
.envfile in the root directory and add the following env variable:UNSTRUCTURED_API_KEY. - Run the MCP server:
uv run doc_processor.py - Go to
~/Library/Application Support/Claude/and create aclaude_desktop_config.json. In that file add:
{
"mcpServers": {
"unstructured_doc_processor": {
"command": "PATH/TO/YOUR/UV",
"args": [
"--directory",
"ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/",
"run",
"doc_processor.py"
],
"disabled": false
}
}
}
- Restart Claude Desktop. You should now be able to use the MCP.
Related Servers
Amazon
Interact with Amazon services for product search, cart management, and viewing order history.
Plane
The official Plane MCP server provides integration with Plane APIs, enabling full AI automation of Plane projects, work items, cycles and more.
Desktop Automation
Automate desktop actions and interact with your local environment using LLM applications.
Homelab MCP Server
Manage and monitor homelab systems via SSH.
Mermaid-MCP
Generat 22 types mermaid architecture diagrams with natural language description
Israeli Bank MCP
Manage Israeli bank accounts and transactions.
Overleaf
Access and analyze Overleaf projects and LaTeX files through Git integration.
PaperlessMCP
MCP server for Paperless-ngx document management. 43 tools for AI-powered document organization - full CRUD on documents, tags, correspondents, document types, storage paths, and custom fields.
DAISYS
Generate high-quality text-to-speech and text-to-voice outputs using the DAISYS platform.
Limitless MCP
Connect your Limitless Pendant data to Claude and other LLMs using the Limitless API.