MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
相關伺服器
Kone.vc
贊助Monetize your AI agent with contextual product recommendations
SSE Calculator
A stateful calculator server using Server-Sent Events (SSE) for real-time communication.
Follow Plan
Track and manage AI implementation plans.
WunderTrading MCP Server
WunderTrading MCP connects AI agents to live crypto trading execution on 20+ supported exchanges through a single MCP integration. Use it to turn AI-driven signals, market analysis, sentiment, screenshots, and custom strategy logic into real trades on exchanges including Binance, Bybit, Coinbase, Bitget, OKX, KuCoin, Hyperliquid, and BingX. Supports MCP and REST API workflows for order execution, trade management, and AI-powered automation.
McpMux
McpMux — configure MCP servers once, connect every AI client through one local gateway.
PowerPoint MCP Server
Manipulate PowerPoint presentations using the python-pptx library.
DeepSRT
Summarize YouTube videos using the DeepSRT API.
Jira Thing
An example MCP server for interacting with Jira, deployable on Cloudflare Workers.
Obsidian
Manage Obsidian vaults with knowledge graph operations and AI-powered features.
CoverSavvy Term Life Rates
Get indicative term life insurance rates for a variety of ages, updated recently.
Handwriting OCR
Recognize and extract text from handwritten documents using the Handwriting OCR service.