MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
เซิร์ฟเวอร์ที่เกี่ยวข้อง
Kone.vc
ผู้สนับสนุนMonetize your AI agent with contextual product recommendations
Google Tasks
Manage your Google Tasks with an AI assistant using natural language.
md-pdf-mcp
Converts Markdown to styled PDFs using VS Code's markdown styling and Python's ReportLab.
Linear
Query and search for issues in Linear, a project management tool.
Weather Service MCP Server
A simple note-saving system with a custom URI scheme, providing tools to add and summarize notes.
Apple Notes
Interact with Apple Notes using natural language on macOS.
TranscriptionTools
An MCP server for intelligent transcript processing, including formatting, repair, and summarization.
GitHub Knowledge Base MCP
MCP server to connect to and use any public GitHub repo as knowledge base (markdown/notebook docs)
EndNote MCP Service
Reads EndNote .enl libraries and exposes their contents through the MCP interface.
Kultur.dev
Cultural intelligence MCP server providing AI-powered cultural context analysis, localization guidance, and cross-cultural communication insights for 195 countries.
Quire MCP Server
Interact with Quire.io projects and tasks using the Quire API, enabling AI assistants to manage your workflow.