unstructured-mcp
Provides unstructured document processing capabilities using the Unstructured API.
A Model Context Protocol server that provides unstructured document processing capabilities. This server enables LLMs to extract and use content from an unstructured document.
This repo is work in progress, proceed with caution :)
Supported file types:
{".abw", ".bmp", ".csv", ".cwk", ".dbf", ".dif", ".doc", ".docm", ".docx", ".dot",
".dotm", ".eml", ".epub", ".et", ".eth", ".fods", ".gif", ".heic", ".htm", ".html",
".hwp", ".jpeg", ".jpg", ".md", ".mcw", ".mw", ".odt", ".org", ".p7s", ".pages",
".pbd", ".pdf", ".png", ".pot", ".potm", ".ppt", ".pptm", ".pptx", ".prn", ".rst",
".rtf", ".sdp", ".sgl", ".svg", ".sxg", ".tiff", ".txt", ".tsv", ".uof", ".uos1",
".uos2", ".web", ".webp", ".wk2", ".xls", ".xlsb", ".xlsm", ".xlsx", ".xlw", ".xml",
".zabw"}
Prerequisites: You'll need:
- Unstructured API key. Learn how to obtain one here
- Claude Desktop installed locally
Quick TLDR on how to add this MCP to your Claude Desktop:
- Clone the repo and set up the UV environment.
- Create a
.envfile in the root directory and add the following env variable:UNSTRUCTURED_API_KEY. - Run the MCP server:
uv run doc_processor.py - Go to
~/Library/Application Support/Claude/and create aclaude_desktop_config.json. In that file add:
{
"mcpServers": {
"unstructured_doc_processor": {
"command": "PATH/TO/YOUR/UV",
"args": [
"--directory",
"ABSOLUTE/PATH/TO/YOUR/unstructured-mcp/",
"run",
"doc_processor.py"
],
"disabled": false
}
}
}
- Restart Claude Desktop. You should now be able to use the MCP.
相關伺服器
Browser MCP
Automate your local browser
Confluence MCP Server
Programmatically access and search Confluence spaces, pages, and content using its REST API.
WayStation
A universal remote MCP server that connects to popular productivity tools such as Notion, Monday, AirTable, and many more.
OneNote MCP
An MCP server for Microsoft OneNote that supports personal notebooks and caches credentials for authentication.
Google Spreadsheet MCP
Full Google Sheets integration - read, write, format cells, create charts, use formulas, and manage spreadsheets.
PaperlessMCP
MCP server for Paperless-ngx document management. 43 tools for AI-powered document organization - full CRUD on documents, tags, correspondents, document types, storage paths, and custom fields.
Asana
Interact with the Asana API to manage tasks, projects, and workspaces.
Google Sheets MCP
A server for interacting with Google Sheets, allowing you to read, write, and manage spreadsheet data.
Date-time Tools
A server for date-time manipulation and timezone conversion.
Logseq MCP Server
Interact with your Logseq knowledge base to create pages, manage blocks, and organize information programmatically.