MCP Read Images
Analyze images using OpenRouter's vision models. Requires an OpenRouter API key.
MCP Read Images
An MCP server for analyzing images using OpenRouter vision models. This server provides a simple interface to analyze images using various vision models like Claude-3.5-sonnet and Claude-3-opus through the OpenRouter API.
Installation
npm install @catalystneuro/mcp_read_images
Configuration
The server requires an OpenRouter API key. You can get one from OpenRouter.
Add the server to your MCP settings file (usually located at ~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json for VSCode):
{
"mcpServers": {
"read_images": {
"command": "read_images",
"env": {
"OPENROUTER_API_KEY": "your-api-key-here",
"OPENROUTER_MODEL": "anthropic/claude-3.5-sonnet" // optional, defaults to claude-3.5-sonnet
},
"disabled": false,
"autoApprove": []
}
}
}
Usage
The server provides a single tool analyze_image that can be used to analyze images:
// Basic usage with default model
use_mcp_tool({
server_name: "read_images",
tool_name: "analyze_image",
arguments: {
image_path: "/path/to/image.jpg",
question: "What do you see in this image?" // optional
}
});
// Using a specific model for this call
use_mcp_tool({
server_name: "read_images",
tool_name: "analyze_image",
arguments: {
image_path: "/path/to/image.jpg",
question: "What do you see in this image?",
model: "anthropic/claude-3-opus-20240229" // overrides default and settings
}
});
Model Selection
The model is selected in the following order of precedence:
- Model specified in the tool call (
modelargument) - Model specified in MCP settings (
OPENROUTER_MODELenvironment variable) - Default model (anthropic/claude-3.5-sonnet)
Supported Models
The following OpenRouter models have been tested:
- anthropic/claude-3.5-sonnet
- anthropic/claude-3-opus-20240229
Features
- Automatic image resizing and optimization
- Configurable model selection
- Support for custom questions about images
- Detailed error messages
- Automatic JPEG conversion and quality optimization
Error Handling
The server handles various error cases:
- Invalid image paths
- Missing API keys
- Network errors
- Invalid model selections
- Image processing errors
Each error will return a descriptive message to help diagnose the issue.
Development
To build from source:
git clone https://github.com/catalystneuro/mcp_read_images.git
cd mcp_read_images
npm install
npm run build
License
MIT License. See LICENSE for details.
관련 서버
Scout Monitoring MCP
스폰서Put performance and error data directly in the hands of your AI assistant.
Alpha Vantage MCP Server
스폰서Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
Terraform MCP Server
Integrates with Terraform Registry APIs for Infrastructure as Code development, supporting provider and module discovery.
SMART-E2B
Integrates E2B for secure code execution in cloud sandboxes, designed for Claude AI Desktop.
mcp-ssh-sre
An MCP server providing read-only server monitoring tools to AI assistants. Runs predefined diagnostic commands over SSH and passes only the results to the LLM - your server credentials and shell are never exposed.
Adobe After Effects
Control Adobe After Effects through a standardized protocol, enabling AI assistants and other applications.
MCP-Insomnia
An MCP server for AI agents to create and manage API collections in Insomnia-compatible format.
MCP Tree-sitter Server
A server for code analysis using Tree-sitter, with context management capabilities.
FDEP MCP Server
A static code analysis server for enterprise-scale Haskell codebases, providing over 40 comprehensive analysis tools.
Credos
Share your team's Coding Best Practices with Cursor, VS Code, Claude code, Windsurf, JetBrains IDEs and other coding tools supporting remote MCP connection.
FileScopeMCP
Analyzes your codebase identifying important files based on dependency relationships. Generates diagrams and importance scores per file, helping AI assistants understand the codebase. Automatically parses popular programming languages, Python, Lua, C, C++, Rust, Zig.
Video Agent MCP Server
An MCP server for AI-powered video creation using the FAL AI API.