MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Server Terkait
Kone.vc
sponsorMonetize your AI agent with contextual product recommendations
PM33 MCP Server
AI-native product management MCP server with 17 tools and 11 resources. WSJF backlog optimization, portfolio scheduling, Monte Carlo forecasting, velocity analytics, competitive intelligence, strategic alignment, PRD generation, sprint management. Integrates with Jira, Linear, and Asana.
Video Editor
Add, analyze, search, and edit videos using the Video Jungle API. Also supports local video search on macOS.
Google Sheets (Go)
A Go-based MCP server for integrating Google Sheets with Claude.
MCP Server for Bring! Shopping
Interact with the Bring! shopping list API via a local MCP server.
DaVinci Resolve MCP
An MCP server integration for the DaVinci Resolve video editing software.
Unmarkdown
The document publishing layer for AI tools: Create, style, and publish formatted documents from any MCP client.
Xwiki MCP
MCP server for XWiki REST API — read your wiki from Claude and other AI agents
Docmost
An open-source collaborative wiki and documentation software with AI integration via MCP.
RevenueCat to Adapty Migration
A server for migrating subscription businesses from RevenueCat to Adapty, requiring a RevenueCat API key.
Jotform
An MCP server which brings Jotform to your AI client or LLM