Analyze images using OpenRouter's vision models. Requires an OpenRouter API key.
An MCP server for analyzing images using OpenRouter vision models. This server provides a simple interface to analyze images using various vision models like Claude-3.5-sonnet and Claude-3-opus through the OpenRouter API.
npm install @catalystneuro/mcp_read_images
The server requires an OpenRouter API key. You can get one from OpenRouter.
Add the server to your MCP settings file (usually located at ~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json
for VSCode):
{
"mcpServers": {
"read_images": {
"command": "read_images",
"env": {
"OPENROUTER_API_KEY": "your-api-key-here",
"OPENROUTER_MODEL": "anthropic/claude-3.5-sonnet" // optional, defaults to claude-3.5-sonnet
},
"disabled": false,
"autoApprove": []
}
}
}
The server provides a single tool analyze_image
that can be used to analyze images:
// Basic usage with default model
use_mcp_tool({
server_name: "read_images",
tool_name: "analyze_image",
arguments: {
image_path: "/path/to/image.jpg",
question: "What do you see in this image?" // optional
}
});
// Using a specific model for this call
use_mcp_tool({
server_name: "read_images",
tool_name: "analyze_image",
arguments: {
image_path: "/path/to/image.jpg",
question: "What do you see in this image?",
model: "anthropic/claude-3-opus-20240229" // overrides default and settings
}
});
The model is selected in the following order of precedence:
model
argument)OPENROUTER_MODEL
environment variable)The following OpenRouter models have been tested:
The server handles various error cases:
Each error will return a descriptive message to help diagnose the issue.
To build from source:
git clone https://github.com/catalystneuro/mcp_read_images.git
cd mcp_read_images
npm install
npm run build
MIT License. See LICENSE for details.
Enable your code gen agents to create & run 0-config end-to-end tests against new code changes in remote browsers via the Debugg AI testing platform.
Beautiful HTML and PNG diff visualization using diff2html, designed for filesystem edit_file dry-run output with high-performance Bun runtime.
Work on dataset metadata with MLCommons Croissant validation and creation.
Enhances large language models with protein structure analysis capabilities, including active site analysis and disease-protein searches, by connecting to the RCSB Protein Data Bank.
Model Kontext Protocol Server for Kubernetes that allows LLM-powered applications to interact with Kubernetes clusters through native Go implementation with direct API integration and comprehensive resource management.
Interact with the Qase API for test management. Requires a QASE_API_TOKEN for authentication.
Open-source tool for collaborative editing, versioning, evaluating, and releasing prompts.
Access real-time and historical token, wallet, and trading data from the Solana ecosystem via the Solana Tracker API.
A service framework supporting the Model Context Protocol (MCP) to integrate enterprise systems and AI platforms via RESTful, gRPC, and Dubbo protocols.
A remote MCP server example deployable on Cloudflare Workers without authentication.