MCP Screenshot
Captures screenshots and performs OCR text recognition.
MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)
OCR Engines
This server uses two OCR engines:
-
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server
-
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition
Installation
npx -y @kazuph/mcp-screenshot
Claude Desktop Configuration
Add the following configuration to your claude_desktop_config.json:
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
Environment Variables
| Variable Name | Description | Default Value |
|---|---|---|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |
Usage Example
You can use it by instructing Claude like this:
Please take a screenshot of the left half of the screen and recognize the text in it.
Tool Specification
capture
Takes a screenshot and performs OCR.
Options:
region: Screenshot area ('left'/'right'/'full', default: 'left')format: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')
License
MIT
Author
kazuph
Server Terkait
CData Google Calendars
A read-only MCP server by CData that enables LLMs to query live Google Calendars data. Requires a separate CData JDBC Driver for Google Calendars.
VAP media MCP
: MCP server for AI media generation (imagesflux, videosveo3.1, music suno v5, with deterministic cost control using reserve-burn-refund billing
Cursor Task Manager
An MCP server for task management, time tracking, and workflow automation, integrated with Cursor IDE and a Directus backend.
Google Docs & Drive
Interact with Google Docs and Google Drive for document creation, editing, and file management, with support for shared drives.
clipboard-mcp
MCP server that reads and writes the system clipboard — tables, text, code, JSON, URLs, images, and more. Preserves spreadsheet structure (rows/columns) that is lost when pasting into Claude directly. Claude can also write results back to your clipboard.
Patent Connector
Connect Patent APIs like EPO OPS, USPTO ODP or Google Patents to your chatbot.
FireHydrant
An MCP server for interacting with the FireHydrant incident management platform API.
TalkToAnki
Seamlessly integrate the Anki flashcard application with AI assistants. Requires Anki desktop with the AnkiConnect plugin.
Penfield
Stores conversations, connects ideas, remembers context: a knowledge graph for your AI assistants
Portfolio Manager MCP Server
A server providing tools and resources for managing and analyzing investment portfolios.