MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
관련 서버
sharepoint-mcp
The MCP Server that gives your AI agent a brain for Microsoft SharePoint
t-pane MCP Server
Manage tmux panes and execute commands within tmux sessions.
Ramp
Interact with Ramp's Developer API to run analysis on your spend and gain insights leveraging LLMs
Fastmail MCP Server
An open-source server that gives AI assistants full access to Fastmail email, calendars, and contacts over JMAP.
Backlog MCP Server
An MCP server for interacting with the Backlog API, a project management and collaboration tool.
Saber
Find buying signals for companies and contacts
Windows-MCP
A lightweight MCP server for interacting with the Windows Operating System.
Atlassian
Interact with Atlassian tools like Confluence and Jira.
Office 365 Calendar
Access Office 365 Calendar data, providing workday insights and productivity analytics.
JMX MCP Server
Provides JMX monitoring and management capabilities for AI assistants. Requires Java 17+.