unstructured-mcp
Provides unstructured document processing capabilities using the Unstructured API.
A Model Context Protocol server that provides unstructured document processing capabilities. This server enables LLMs to extract and use content from an unstructured document.
This repo is work in progress, proceed with caution :)
Supported file types:
{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot",
".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html",
".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages",
".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst",
".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1",
".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml",
".zabw"}
Prerequisites: You'll need:
- Unstructured API key. Learn how to obtain one here
- Claude Desktop installed locally
Quick TLDR on how to add this MCP to your Claude Desktop:
- Clone the repo and set up the UV environment.
- Create a
.envfile in the root directory and add the following env variable:UNSTRUCTURED_API_KEY. - Run the MCP server:
uv run doc_processor.py - Go to
~/Library/Application Support/Claude/and create aclaude_desktop_config.json. In that file add:
{
"mcpServers": {
"unstructured_doc_processor": {
"command": "PATH/TO/YOUR/UV",
"args": [
"--directory",
"ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/",
"run",
"doc_processor.py"
],
"disabled": false
}
}
}
- Restart Claude Desktop. You should now be able to use the MCP.
Related Servers
itemit-mcp
An MCP server for asset tracking that connects to the itemit asset management API.
HireBase
Interact with the HireBase Job API to manage job listings and applications.
Fathom Analytics
Access and analyze Fathom Analytics data and reports
Feishu MCP Server
Access and manage Feishu documents for AI coding tools, enabling structured content retrieval, editing, and search.
Doc Reading and Converter
A server for reading and converting documents between PDF, DOCX, and Markdown formats using marker-pdf and pandoc.
Research MCP Server
A research server that retrieves and creates survey data pages in Notion.
Todo List
A server that provides a comprehensive API for managing todo items.
Bear MCP Server
Access and manage your Bear Notes from a local SQLite database.
Kit.com (formerly ConvertKit) MCP
Manage your email lists, subscribers, broadcasts, sequences, and more through natural language.
Redmine
An MCP server for interacting with the Redmine project management system.