Captures screenshots and performs OCR text recognition.
An MCP server that captures screenshots and performs OCR text recognition.
This server uses two OCR engines:
npx -y @kazuph/mcp-screenshot
Add the following configuration to your claude_desktop_config.json
:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Variable Name | Description | Default Value |
---|---|---|
OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Takes a screenshot and performs OCR.
Options:
region
: Screenshot area ('left'/'right'/'full', default: 'left')format
: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')MIT
kazuph
Transcribe YouTube videos in multiple languages.
Automatically tailors resumes for specific job applications using LibreOffice.
A universal interface to manage and interact with all your MCP servers from a single point, using external configuration files for mappings and credentials.
Interact with Amazon services for product search, cart management, and viewing order history.
Enables AI assistants to interact with Anki flashcard decks via the AnkiConnect plugin.
Integrate with Google Tasks to manage your to-do lists and tasks.
Dynamic and reflective problem-solving through thought sequences
A collection of MCP servers to automate virtual secretary tasks within the Microsoft Outlook suite, managing tools like Mail and Categories.
A Chrome extension that integrates MCP tools with AI platforms like ChatGPT and Gemini, allowing users to execute tools and insert results directly into conversations.
Integrates with the Strateegia API, allowing AI assistants to access and interact with Strateegia projects.