MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
संबंधित सर्वर
Stitch MCP
The Stitch MCP server enables AI assistants to interact with Stitch for vibe design: generating UI designs from text and images, and accessing project and screen details.
Inked
A memory management server for Claude apps with optional AI-powered search, using local SQLite storage.
Serpstat API MCP Server
A TypeScript server that integrates Serpstat SEO API with Anthropic's Model Context Protocol (MCP), enabling AI assistants like Claude to access comprehensive SEO data and analysis tools.
MCP Wait Timer Server
A simple tool to pause execution for a specified number of seconds.
Hyperweb
A server for interacting with the Hyperweb platform and its tools using AI agents.
文颜 MCP Server
A Markdown formatting tool to apply themes and publish articles to WeChat Official Accounts.
Feishu Project Management
An MCP server for interacting with the Feishu project management system, enabling AI assistants to manage projects.
Divide and Conquer
Breaks down complex tasks into manageable pieces and stores them in structured JSON.
Xwiki MCP
MCP server for XWiki REST API — read your wiki from Claude and other AI agents
Interactive Leetcode MCP
An MCP server enabling guided DSA learning with AI on leetcode.com