MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Похожие серверы
Sequential Story
An MCP server for problem-solving using Sequential Thinking and Sequential Story mnemonic techniques.
PromptX
An AI capability enhancement system providing professional roles, memory management, and knowledge systems for applications like Claude and Cursor.
TempMail
Create temporary emails and read messages directly from your AI agent.
Google Workspace
Integrates Google Workspace services like Calendar, Drive, and Gmail with AI assistants.
MCP Orchestrator
A universal interface to manage and interact with all your MCP servers from a single point, using external configuration files for mappings and credentials.
Puzld MCP
Expose your local AI agent CLIs through one MCP endpoint
System Resource Monitor MCP Server
Monitors system resources in real-time, including CPU, memory, disk, network, battery, and internet speed.
JIRA
Integrate Atlassian JIRA into any MCP-compatible application to manage issues and projects.
ServiceTitan MCP Server
An MCP server for integrating with the ServiceTitan platform.
Moneybird MCP Server
Connects AI assistants to Moneybird accounting software via its API.