Captures screenshots and performs OCR text recognition.
An MCP server that captures screenshots and performs OCR text recognition.
This server uses two OCR engines:
npx -y @kazuph/mcp-screenshot
Add the following configuration to your claude_desktop_config.json
:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Variable Name | Description | Default Value |
---|---|---|
OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Takes a screenshot and performs OCR.
Options:
region
: Screenshot area ('left'/'right'/'full', default: 'left')format
: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')MIT
kazuph
A multi-agent mesh network designed for completing AI tasks in parallel.
Human-in-the-loop platform - Allow AI agents and automations to send requests for approval to your gotoHuman inbox.
Fetch and summarize YouTube videos by extracting their titles, descriptions, and transcripts.
Upload videos to YouTube using OAuth2 authentication. Requires a Google OAuth 2.0 client secret file.
Connect to the Taskade platform via MCP. Access tasks, projects, workflows, and AI agents in real-time through a unified workspace and API.
Interact with Ramp's Developer API to run analysis on your spend and gain insights leveraging LLMs
AI-powered task orchestration and workflow automation with specialized agent roles, intelligent task decomposition, and seamless integration across Claude Desktop, Cursor IDE, Windsurf, and VS Code.
Manage and interact with Google Calendar events using the Google Calendar API.
Connect with 10,000+ tools across HRIS, ATS, CRM, Accounting, Calendar, Meeting, Ticketing, and more categories.
Simple unofficial MCP server to track time via Toggl API