Universal Image MCP

Universal MCP server for AI image generation supporting AWS Bedrock (Nova Canvas), OpenAI (GPT Image, DALL-E), and Google Gemini (Imagen 4). Generate, transform, and edit images using multiple AI models through a single Model Context Protocol interface.

GitHub

Universal Image MCP - Multi-Provider AI Image Generation Server for Claude Desktop & MCP Clients

Universal MCP server for AI image generation supporting AWS Bedrock (Nova Canvas), OpenAI (GPT Image, DALL-E), and Google Gemini (Imagen 4). Generate, transform, and edit images using multiple AI models through a single Model Context Protocol interface.

What is Universal Image MCP?

Universal Image MCP is a Model Context Protocol (MCP) server that provides unified access to multiple AI image generation providers. Whether you're using Claude Desktop, Kiro IDE, or any MCP-compatible client, this server lets you generate and transform images using: n

AWS Bedrock - Amazon Nova Canvas for enterprise-grade image generation
OpenAI - GPT Image 1.5, ChatGPT Image, DALL-E models
Google Gemini - Gemini 2.5 Flash Image, Imagen 4, Imagen 4 Ultra

Perfect for developers building AI applications, content creators, and anyone needing programmatic access to multiple image generation APIs through a single interface.

Example Outputs

Comparison of architecture diagrams generated by different models using this MCP server:

View Prompt Used

Technical Diagram Prompt:

Technical architecture diagram of a Universal Image MCP Server system. The diagram shows:

Top layer: MCP Client (Claude Desktop, Kiro IDE) connecting via Model Context Protocol

Middle layer: Universal Image MCP Server (FastMCP) with three main components:
1. Server Module (server.py) - handles list_models, generate_image, transform_image, prompt_guide tools
2. Provider Module (providers.py) - manages lazy initialization and provider abstraction
3. Configuration - environment variables for ENABLE_AWS, ENABLE_OPENAI, ENABLE_GEMINI

Bottom layer: Three provider boxes side by side:
- AWS Bedrock (boto3) - Amazon Nova Canvas, with AWS credentials and region config
- OpenAI API - GPT Image 1.5, ChatGPT Image Latest, with API key
- Google Gemini API - Gemini 2.5 Flash, Imagen 4, with API key

Data flow arrows showing:
- Client sends tool requests to Server
- Server routes to appropriate Provider based on model_id
- Providers make API calls to their respective services
- Image data flows back through the chain

Clean, professional software architecture diagram style with boxes, arrows, and labels. Use blue and gray color scheme. Modern technical documentation aesthetic. Isometric or layered view showing clear separation of concerns.

3D Clay Art Prompt:

Same technical architecture content as above, but rendered in:

3D clay art style with soft rounded shapes, pastel colors, cute minimalist aesthetic, soft studio lighting, clean composition with depth and shadows.

Note: 3D Clay Art versions used s3tablearch.png as a reference image for style guidance.

Key Features

🔄 Multi-Provider Support - Switch between AWS Bedrock, OpenAI, and Google Gemini seamlessly
🚀 Dynamic Model Discovery - Automatically fetches latest available models from each provider API
⚡ Lazy Initialization - Provider clients load only when needed for optimal performance
🎨 Reference Image Support - Generate new images based on existing image styles
📐 Configurable Dimensions - Custom width/height for supported AI models
📚 Built-in Prompt Guide - Best practices for writing effective image generation prompts
🔌 MCP Protocol - Works with Claude Desktop, Kiro IDE, and all MCP-compatible clients
🐍 Python 3.11+ - Modern Python with type hints and async support

Quick Start Installation

Install via pip:

pip install universal-image-mcp

Or use with uvx (recommended for MCP servers):

uvx universal-image-mcp@latest

MCP Server Configuration

For Claude Desktop

Add to your Claude Desktop MCP configuration file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "universal-image-mcp": {
      "command": "uvx",
      "args": ["universal-image-mcp@latest"],
      "env": {
        "ENABLE_AWS": "true",
        "AWS_PROFILE": "default",
        "AWS_REGION": "us-east-1",
        
        "ENABLE_OPENAI": "true",
        "OPENAI_API_KEY": "sk-...",
        
        "ENABLE_GEMINI": "true",
        "GEMINI_API_KEY": "..."
      }
    }
  }
}

For Kiro IDE

Add to ~/.kiro/settings/mcp.json:

{
  "mcpServers": {
    "universal-image-mcp": {
      "command": "uvx",
      "args": ["universal-image-mcp@latest"],
      "env": {
        "ENABLE_AWS": "true",
        "ENABLE_OPENAI": "true",
        "OPENAI_API_KEY": "sk-...",
        "ENABLE_GEMINI": "true",
        "GEMINI_API_KEY": "..."
      }
    }
  }
}

Getting API Keys and Credentials

Before using this MCP server, you'll need to obtain credentials for the providers you want to use.

AWS Bedrock Setup

AWS Bedrock uses your local AWS credentials. You have several options:

AWS CLI Configuration (Recommended)
- Install the AWS CLI
- Run aws configure and provide your access key, secret key, and region
- Official guide: AWS CLI Configuration
AWS Credentials File
- Create ~/.aws/credentials with your access keys
- Create ~/.aws/config with your region settings
- Official guide: Shared Config and Credentials Files
Environment Variables
- Set AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_REGION
- Official guide: AWS CLI Authentication

Getting AWS Access Keys:

Sign in to the AWS Console
Navigate to IAM → Users → Your User → Security Credentials
Create a new access key under "Access keys"
Ensure your IAM user has permissions for Bedrock (e.g., AmazonBedrockFullAccess policy)

OpenAI API Key

Create an OpenAI Account
- Visit OpenAI Platform
- Sign up or log in to your account
Generate API Key
- Go to API Keys page
- Click "Create new secret key"
- Give it a descriptive name (optional)
- Copy the key immediately (you won't be able to see it again)
Add Billing Information
- OpenAI requires payment information to use the API
- Navigate to Billing to add payment details

Official Documentation: OpenAI Quickstart Guide

Google Gemini API Key

Get a Gemini API Key
- Visit Google AI Studio
- Sign in with your Google account
- Click "Get API Key" or "Create API Key"
- Create a new project or select an existing one
- Copy your API key
Alternative: Google Cloud API Key
- For production use, you can use Vertex AI on Google Cloud
- This provides more enterprise features and billing controls

Official Documentation: Gemini API Quickstart

Environment Variables

API Reference - MCP Tools

list_models()

List all available AI image generation models from enabled providers. Models are fetched dynamically from each provider's API with deprecated models automatically filtered.

Returns: Formatted list of model IDs compatible with generate_image() and transform_image()

Example models:

AWS: amazon.nova-canvas-v1:0
OpenAI: gpt-image-1.5, chatgpt-image-latest
Gemini: models/gemini-2.5-flash-image, models/imagen-4.0-generate-001

generate_image(prompt, model_id, output_path, reference_image?, width?, height?)

Generate AI images from text prompts using any supported model.

transform_image(image_path, prompt, model_id, output_path)

Transform and edit existing images using AI-powered modifications based on text prompts.

Use cases: Image editing, style transfer, AI-powered photo manipulation, artistic transformations

prompt_guide()

Get AI prompt engineering best practices for image generation. Returns comprehensive guidelines covering:

Prompt structure (Subject + Details + Style + Lighting + Mood + Composition)
Specific vs generic descriptions
Style, lighting, and mood keywords
Example prompts for different use cases

Supported AI Image Models

All models are discovered dynamically. Use list_models() to see current options.

AWS Bedrock Models

Amazon Nova Canvas (amazon.nova-canvas-v1:0) - Enterprise-grade image generation with text and image input support

OpenAI Models

GPT Image 1.5 (gpt-image-1.5) - Latest OpenAI image generation model
ChatGPT Image Latest (chatgpt-image-latest) - ChatGPT-integrated image generation

Google Gemini Models

Gemini 2.5 Flash Image (models/gemini-2.5-flash-image) - Fast, efficient image generation
Gemini 3 Pro Image (models/gemini-3-pro-image-preview) - Advanced image generation capabilities
Imagen 4 (models/imagen-4.0-generate-001) - Google's state-of-the-art image model
Imagen 4 Ultra (models/imagen-4.0-ultra-generate-001) - Highest quality Imagen model
Imagen 4 Fast (models/imagen-4.0-fast-generate-001) - Optimized for speed

Use Cases

AI Application Development - Integrate multiple image generation providers into your apps
Content Creation - Generate marketing materials, social media content, illustrations
Prototyping & Design - Quickly visualize concepts and design ideas
Image Editing Automation - Batch process and transform images with AI
Research & Experimentation - Compare outputs across different AI models
Claude Desktop Workflows - Enhance Claude conversations with image generation
Developer Tools - Build MCP-compatible tools and extensions

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Related Projects

Model Context Protocol - Official MCP documentation
Claude Desktop - AI assistant with MCP support
FastMCP - Python framework for building MCP servers

Keywords

mcp-server image-generation ai-images aws-bedrock openai google-gemini claude-desktop imagen nova-canvas python fastmcp model-context-protocol ai-art text-to-image image-transformation

License

MIT

Related Servers

Immanuel MCP Server

An MCP server for astrological chart calculations using the immanuel-python library.

GuessMarket MCP

Prediction market trading server for GuessMarket. Browse markets, buy/sell shares, add/remove liquidity, check portfolios, and build on-chain transactions. 20+ tools.

FRITZ!Box MCP Server

Control AVM FRITZ!Box routers - manage devices, WiFi, network settings, parental controls, and schedule time-delayed actions

trainedby.ai

Connect wearables to ChatGPT, so your AI coach knows you without typing a word

Time MCP Server

Provides current time and timezone conversion capabilities using IANA timezone names, with automatic system timezone detection.

NVD CVE MCP Server

The NVD CVE MCP Server is a powerful security research tool

N.I.N.A. Advanced API

Control the N.I.N.A. (Nighttime Imaging 'N' Astronomy) software through its Advanced API.

Image

Fetch and process images from URLs, local file paths, and numpy arrays, returning them as base64-encoded strings.

Ecovacs Robot Control

Control and query the status of Ecovacs cleaning robots using the MCP protocol.

BWA (Burrows-Wheeler Aligner)

An MCP server for the BWA sequence alignment tool.