Gemini Imagen 3.0
Generate high-quality images using Google's Imagen 3.0 model via the Gemini API.
Gemini Imagen 3.0 MCP Server
A professional Model Context Protocol (MCP) server implementation that harnesses Google's Imagen 3.0 model through the Gemini API for high-quality image generation. Built with TypeScript and designed for seamless integration with Claude Desktop and other MCP-compatible hosts.
🌟 Features
- Leverage Google's state-of-the-art Imagen 3.0 model via Gemini API
- Generate up to 4 high-quality images per request
- Automatic file management with intelligent naming
- HTML preview generation with file:// protocol support
- Built on MCP protocol for AI agent compatibility
- TypeScript implementation with robust error handling
🚀 Quick Start
Prerequisites
- Node.js 18 or higher
- Google Gemini API key
- Claude Desktop or another MCP-compatible host
Installation
- Clone the repository:
git clone https://github.com/yourusername/gemini-imagen-mcp-server.git
cd gemini-imagen-mcp-server
- Install dependencies:
npm install
- Build the TypeScript code:
npm run build
⚙️ Configuration
- Configure Claude Desktop by adding to
claude_desktop_config.json:
{
"mcpServers": {
"gemini-image-gen": {
"command": "node",
"args": ["./build/index.js"],
"cwd": "<path-to-project-directory>",
"env": {
"GEMINI_API_KEY": "your-gemini-api-key"
}
}
}
}
- Replace placeholders:
<path-to-project-directory>: Your project pathyour-gemini-api-key: Your Gemini API key
🛠️ Available Tools
1. generate_images
Generates images using Google's Imagen 3.0 model.
Parameters:
prompt(required): Text description of the image to generatenumberOfImages(optional): Number of images (1-4, default: 1)
File Management:
- Images are automatically saved in
G:\image-gen3-google-mcp-server\images - Filenames follow the pattern:
{sanitized-prompt}-{timestamp}-{index}.png - Timestamps ensure unique filenames
- Prompts are sanitized for safe filesystem usage
Example:
Generate an image of a futuristic city at night
2. create_image_html
Creates HTML preview tags for generated images.
Parameters:
imagePaths(required): Array of image file pathswidth(optional): Image width in pixels (default: 512)height(optional): Image height in pixels (default: 512)
Returns HTML tags with absolute file:// URLs for local viewing.
Example:
Create HTML tags for the generated images with width=400
🔧 Development
# Install dependencies
npm install
# Build TypeScript
npm run build
# Run tests (when available)
npm test
🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request. For major changes:
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
📝 Error Handling
The server implements two main error codes:
tool_not_found(1): When the requested tool is not availableexecution_error(2): When image generation or HTML creation fails
📄 License
MIT License - see the LICENSE file for details.
✨ Author
Falah G. Salieh
- Copyright © 2025
- GitHub: @yourgithubhandle
- Email: [email protected]
🙏 Acknowledgments
- Google Gemini API and Imagen 3.0 model
- Model Context Protocol (MCP) by Anthropic
- Claude Desktop team for MCP host implementation
📌 Tags
#MCP #Gemini #Imagen3 #AI #ImageGeneration #TypeScript #NodeJS #GoogleAI #ClaudeDesktop
Made with ❤️ by Falah G. Salieh
เซิร์ฟเวอร์ที่เกี่ยวข้อง
Scout Monitoring MCP
ผู้สนับสนุนPut performance and error data directly in the hands of your AI assistant.
Alpha Vantage MCP Server
ผู้สนับสนุนAccess financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
OpenAI GPT Image
Generate and edit images using OpenAI's GPT-4o and DALL-E APIs with advanced prompt control.
Kite Trading MCP Server
An MCP server for the Zerodha Kite Connect API, featuring fully automated authentication without manual token handling.
Windsor
Windsor MCP enables your LLM to query, explore, and analyze your full-stack business data integrated into Windsor.ai with zero SQL writing or custom scripting.
Ebitengine MCP
A server for Ebitengine games that provides debugging and recording tools by capturing game state.
1MCP
A unified MCP server that aggregates multiple MCP servers into a single endpoint.
GraphQL MCP
Interact with GraphQL APIs using LLMs. Supports schema introspection and query execution.
ADB MCP Server
Interact with Android devices using the Android Debug Bridge (ADB).
Remote MCP Server (Authless)
An example of a remote MCP server deployable on Cloudflare Workers, without authentication.
Adamik MCP Server
Interact with over 60 blockchain networks using any MCP client. Requires an Adamik API key.
Text2Sim MCP Server
A multi-paradigm simulation engine for Discrete-Event and System Dynamics, enabling natural language-based simulations via MCP.