MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Verwandte Server
Kone.vc
SponsorMonetize your AI agent with contextual product recommendations
Apple Shortcuts
An MCP Server Integration with Apple Shortcuts
MCP Kanban Memory
Manage complex AI agent workflows with a Kanban-based task management system.
Taiwan Holiday
Provides Taiwan national holidays and compensatory workday information. Data is fetched and cached automatically.
macOS Automator
Run AppleScript and JXA scripts to automate tasks on macOS.
TimeMCP
A server for time and timezone conversion tools.
NPX-MCP
A TypeScript server for automation and integration, featuring Microsoft OAuth, browser control, and basic utilities.
Obsidian Semantic MCP Server
An AI-optimized MCP server for Obsidian that consolidates over 21 tools into 5 intelligent operations with contextual workflow hints.
WooCommerce MCP Server
An MCP server for integrating with the WooCommerce e-commerce platform.
Feishu/Lark OpenAPI
Connects AI agents to the Feishu/Lark platform for automating tasks like document processing, conversation management, and calendar scheduling.
Guck MCP
Guck is a tiny, MCP-first telemetry store for agentic debugging