Gemini Image Analysis
Analyzes image and video content from URLs or local files using the Gemini 2.0 Flash model.
image-mcp-server-gemini
An MCP server that receives image/video URLs or local file paths and analyzes their content using the Gemini 2.0 Flash model.(forked from github.com/champierre/image-mcp-server)
Features
- Analyzes content from one or more image/video URLs or local file paths.
- Analyzes videos directly from YouTube URLs.
- Can analyze relationships between multiple images or videos provided together.
- Supports optional text prompts to guide the analysis.
- High-precision recognition and description using the Gemini 2.0 Flash model.
- URL validity checking and local file loading with Base64 encoding.
- Basic security checks for local file paths.
- Handles various image and video MIME types (see Usage section for details).
Installation
Installing via Smithery
To install Image Analysis Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @Rentapad/image-mcp-server --client claude
Manual Installation
# Clone the repository
git clone https://github.com/Rentapad/image-mcp-server-gemini.git
cd image-mcp-server-gemini
# Install dependencies
npm install
# Compile TypeScript
npm run build
Configuration
To use this server, you need a Gemini API key. Set the following environment variable:
GEMINI_API_KEY=your_gemini_api_key
MCP Server Configuration
To use with tools like Cline, add the following settings to your MCP server configuration file:
For Cline
Add the following to cline_mcp_settings.json:
{
"mcpServers": {
"image-video-analysis": { // Consider renaming for clarity
"command": "node",
"args": ["/path/to/image-mcp-server/dist/index.js"],
"env": {
"GEMINI_API_KEY": "your_gemini_api_key"
}
}
}
}
For Claude Desktop App
Add the following to claude_desktop_config.json:
{
"mcpServers": {
"image-video-analysis": { // Consider renaming for clarity
"command": "node",
"args": ["/path/to/image-mcp-server/dist/index.js"],
"env": {
"GEMINI_API_KEY": "your_gemini_api_key"
}
}
}
}
Usage
Once the MCP server is configured, the following tools become available:
analyze_image: Receives one or more image URLs and analyzes their content.- Arguments:
imageUrls(array of strings, required),prompt(string, optional).
- Arguments:
analyze_image_from_path: Receives one or more local image file paths and analyzes their content.- Arguments:
imagePaths(array of strings, required),prompt(string, optional).
- Arguments:
analyze_video: Receives one or more video URLs and analyzes their content. Best for smaller videos (see Video Notes).- Arguments:
videoUrls(array of strings, required),prompt(string, optional).
- Arguments:
analyze_video_from_path: Receives one or more local video file paths and analyzes their content. Best for smaller videos (see Video Notes).- Arguments:
videoPaths(array of strings, required),prompt(string, optional).
- Arguments:
analyze_youtube_video: Receives a single YouTube video URL and analyzes its content.- Arguments:
youtubeUrl(string, required),prompt(string, optional).
- Arguments:
Usage Examples
Analyzing a single image from URL:
Please analyze this image: https://example.com/image.jpg
Analyzing multiple images from local paths and comparing them:
Analyze these images: /path/to/your/image1.png, /path/to/your/image2.jpeg. Which one contains a cat?
(The client would call analyze_image_from_path with imagePaths: ["/path/to/your/image1.png", "/path/to/your/image2.jpeg"] and prompt: "Which one contains a cat?")
Analyzing a video from URL with a specific prompt:
Summarize the content of this video: https://example.com/video.mp4
(The client would call analyze_video with videoUrls: ["https://example.com/video.mp4"] and prompt: "Summarize the content of this video")
Analyzing a YouTube video:
What is the main topic of this YouTube video? https://www.youtube.com/watch?v=dQw4w9WgXcQ
(The client would call analyze_youtube_video with youtubeUrl: "https://www.youtube.com/watch?v=dQw4w9WgXcQ" and prompt: "What is the main topic of this YouTube video?")
Video Notes
- Size Limit: For videos provided via URL (
analyze_video) or path (analyze_video_from_path), Gemini currently has limitations on the size of video data that can be processed directly (typically around 20MB after Base64 encoding). Larger videos may fail. YouTube analysis does not have this same client-side download limit. - Supported MIME Types: The server attempts to map and use MIME types supported by Gemini for video. Officially supported types include:
video/mp4,video/mpeg,video/mov,video/avi,video/x-flv,video/mpg,video/webm,video/wmv,video/3gpp. Files with other MIME types might be skipped. YouTube videos are handled separately.
Note: Specifying Local File Paths
When using the ..._from_path tools, the AI assistant (client) must specify valid file paths in the environment where this server is running.
- If the server is running on WSL:
- If the AI assistant has a Windows path (e.g.,
C:\...), it needs to convert it to a WSL path (e.g.,/mnt/c/...) before passing it to the tool. - If the AI assistant has a WSL path, it can pass it as is.
- If the AI assistant has a Windows path (e.g.,
- If the server is running on Windows:
- If the AI assistant has a WSL path (e.g.,
/home/user/...), it needs to convert it to a UNC path (e.g.,\\wsl$\Distro\...) before passing it to the tool. - If the AI assistant has a Windows path, it can pass it as is.
- If the AI assistant has a WSL path (e.g.,
Path conversion is the responsibility of the AI assistant (or its execution environment). The server will try to interpret the received path as is, applying basic security checks.
Note: Type Errors During Build
When running npm run build, you may see an error (TS7016) about missing TypeScript type definitions for the mime-types module.
src/index.ts:16:23 - error TS7016: Could not find a declaration file for module 'mime-types'. ...
This is a type checking error, and since the JavaScript compilation itself succeeds, it does not affect the server's execution. If you want to resolve this error, install the type definition file as a development dependency.
npm install --save-dev @types/mime-types
# or
yarn add --dev @types/mime-types
Development
# Run in development mode
npm run dev
License
MIT
Verwandte Server
Weather Union
Provides weather data and air quality information using the Weather Union API.
1Panel
An MCP server for 1Panel, a modern Linux server operation and maintenance panel.
AWS MCP Servers
A suite of MCP servers providing AI applications with access to AWS documentation, contextual guidance, and best practices.
LlamaCloud
Connect to and manage data indexes on the LlamaCloud platform.
Remote MCP Server Authless Rickroll
A remote MCP server on Cloudflare Workers that generates podcast URLs and rickrolls without authentication, using Cloudflare AI and D1.
Terrakube MCP Server
Manage Terrakube workspaces, variables, modules, and organizations.
Render MCP Server
Interact with Render resources via LLMs using the Render API.
Remote MCP Server (Authless)
A remote MCP server deployable on Cloudflare Workers without authentication.
OpenMeteo MCP Server
Access global weather data and forecasts using the OpenMeteo API, including current conditions, historical data, and location search.
GCP MCP Server
Manage and interact with Google Cloud Platform (GCP) resources through an AI assistant.