MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Related Servers
Harvest MCP Server
Manage time tracking, projects, clients, and tasks using the Harvest API.
Squad AI
Product‑discovery and strategy platform integration. Create, query and update opportunities, solutions, outcomes, requirements and feedback from any MCP‑aware LLM.
WP-MCP
Manage and publish WordPress content directly from your AI assistant — no PHP required. Supports both STDIO and Streamable HTTP for broad client compatibility.
icd10-cpt-crosswalk
MCP server for converting ICD-10 diagnosis codes to CPT procedure codes for medical billing
Xero
Interact with the Xero Accounting Software API.
Linear MCP Server
An MCP server for interacting with the Linear API, allowing AI agents to manage issues, projects, and teams.
MCP Educational Tutor
An intelligent tutoring server that uses GitHub documentation repositories to provide structured educational prompts and tools.
Tachikoma Router Management MCP
Manage various router types using natural language. Requires router credentials to be configured.
Claw2Immich
claw2immich is a Python MCP (Model Context Protocol) server that exposes selected Immich Picture App,
Itemit
A simple notes system for Itemit, built with TypeScript.