MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Related Servers
PM Copilot
Triangulates customer support tickets and feature requests to generate prioritized product plans with convergence scoring and PII scrubbing.
Screenshot MCP Server
Capture Windows screenshots from AI assistants in WSL, with options for specific monitors or windows.
Adspirer Ads Manager
Manage digital advertising campaigns with AI-powered insights from Adspirer.com.
Linear MCP Server
Manage projects, issues, and teams using the Linear API.
Maid-MCP
An MCP server that gives Claude Desktop a maid personality, complete with a Japanese-accented voice, visual avatar, and speech recognition.
Notion MCP Server
Official MCP server for the Notion API, enabling real-time interactions via Server-Sent Events (SSE).
Salesforce MCP
Interact with the Salesforce API using jsforce, requiring username and password for authentication.
LAPRAS MCP Server
An MCP server for lapras.com, providing access to career-related tools.
Sheet-Cello
A specialized Google Sheets integration server that allows the LLM to read, write, and manage spreadsheet data in real-time. This server supports cell-level manipulation, bulk range updates, and full worksheet retrieval, enabling the model to perform data analysis, logging, and automated reporting directly within Google Worksheets.If you have functions which take range value then first read the sheet and decide where user is asking to add data and define range by your own.Provides 46 tools for Gsheet
n8n MCP Server
Manage n8n workflows, executions, and credentials through the Model Context Protocol.