Ollama MCP Bridge

A bridge API service connecting Ollama with Model Context Protocol (MCP) servers.

GitHub

Ollama MCP Bridge

Features
Requirements
Installation
How It Works
Configuration
- MCP Servers Configuration
- ✨NEW Tool Filtering
- Variable Expansion
- CORS Configuration
- Environment Variables
Usage
Development
- Key Dependencies
- Testing
Contributing
Related Projects
Inspiration and Credits

Features

🚀 Pre-loaded Servers: All MCP servers are connected at startup from JSON configuration
📝 JSON Configuration: Configure multiple servers with complex commands and environments
🌐 Multiple Transport Types: Connect to MCP servers via stdio (local processes), HTTP (StreamableHTTP), or SSE
🎯 Tool Filtering: Filter tools per server with include/exclude modes for fine-grained control
🧩 Config Variable Expansion: Supports ${env:VAR_NAME} and ${workspaceFolder} in config strings
🔗 Tool Integration: Automatic tool call processing and response integration
🔄 Multi-Round Tool Execution: Automatically loops through multiple rounds of tool calls until completion
🛡️ Configurable Tool Limits: Set maximum tool execution rounds to prevent excessive tool calls
🛠️ All Tools Available: Ollama can use any tool from any connected server simultaneously
🔌 Complete API Compatibility: /api/chat adds tools while all other Ollama API endpoints are transparently proxied
🔧 Configurable Ollama: Specify custom Ollama server URL via CLI (supports local and cloud models)
☁️ Cloud Model Support: Works with Ollama cloud models
🔄 Version Check: Automatic check for newer versions with upgrade instructions
🌊 Streaming Responses: Supports incremental streaming of responses to clients
🤔 Thinking Mode: Proxies intermediate "thinking" messages from Ollama and MCP tools
⚡️ FastAPI Backend: Modern async API with automatic documentation
🏗️ Modular Architecture: Clean separation into CLI, API, and MCP management modules
💻 Typer CLI: Clean command-line interface with configurable options
📊 Structured Logging: Uses loguru for comprehensive logging
📦 PyPI Package: Easily installable via pip or uv from PyPI
🗣️ System Prompt Configuration: Allows setting a system prompt for the assistant's behavior

Requirements

Python >= 3.10.15
Ollama server running (local or remote)
MCP server configuration file with at least one MCP server defined (see below for example)

Installation

You can install ollama-mcp-bridge in several ways, depending on your preference:

Quick Start

Install instantly with uvx:

uvx ollama-mcp-bridge

Or, install from PyPI with pip

pip install --upgrade ollama-mcp-bridge

Or, run with Docker Compose

docker-compose up

This uses the included docker-compose.yml file which:

Builds the bridge from source using this Dockerfile Dockerfile
Connects to Ollama running on the host machine (host.docker.internal:11434)
Maps the configuration file from ./mcp-config.json (includes mock weather server for demo)
Allows all CORS origins (configurable via CORS_ORIGINS environment variable)
Supports configurable Ollama request timeouts via OLLAMA_PROXY_TIMEOUT

Or, install from source

# Clone the repository
git clone https://github.com/jonigl/ollama-mcp-bridge.git
cd ollama-mcp-bridge

# Install dependencies using uv
uv sync

# Start Ollama (if not already running)
ollama serve

# Run the bridge (preferred)
ollama-mcp-bridge

If you want to install the project in editable mode (for development):

# Install the project in editable mode
uv tool install --editable .
# Run it like this:
ollama-mcp-bridge

How It Works

Startup: All MCP servers defined in the configuration are loaded and connected
Version Check: At startup, the bridge checks for newer versions and notifies if an update is available
Tool Collection: Tools from all servers are collected and made available to Ollama
Chat Completion Request (/api/chat endpoint only): When a chat completion request is received on /api/chat:
- The request is forwarded to Ollama (local or cloud) along with the list of all available tools
- If Ollama chooses to invoke any tools, those tool calls are executed through the corresponding MCP servers
- Tool responses are fed back to Ollama
- The process repeats in a loop until no more tool calls are needed
- Responses stream to the client in real-time throughout the entire process
- The final response (with all tool results integrated) is returned to the client
- This is the only endpoint where MCP server tools are integrated.
Other Endpoints: All other endpoints (except /api/chat, /health, and /version) are fully proxied to the underlying Ollama server with no modification.
Logging: All operations are logged using loguru for debugging and monitoring

Configuration

MCP Servers Configuration

You can configure MCP servers in three ways:

Local process (stdio): {"command": "...", "args": [...], "env": {...}}
Remote endpoint (StreamableHTTP): {"url": "https://..."} - Uses StreamableHTTP by default
Remote endpoint (SSE): {"url": "https://.../sse"} - If the URL ends with /sse, the bridge connects via Server-Sent Events

Create an MCP configuration file at mcp-config.json with your servers:

{
  "mcpServers": {
    "weather": {
      "command": "uv",
      "args": [
        "--directory",
        "./mock-weather-mcp-server",
        "run",
        "main.py"
      ],
      "env": {
        "MCP_LOG_LEVEL": "ERROR"
      },
      "toolFilter": {
        "mode": "include",
        "tools": ["get_current_temperature", "get_forecast"]
      }
    },
    "remote_streamable_http": {
      "url": "https://example.com/mcp"
    },
    "remote_sse": {
      "url": "https://example.com/sse"
    },
    "filesystem": {
      "command": "npx",
      "args": [
        "-y",
        "@modelcontextprotocol/server-filesystem",
        "/tmp"
      ],
      "toolFilter": {
        "mode": "exclude",
        "tools": ["delete_file", "write_file"]
      }
    }
  }
}

Tool Filtering

You can filter which tools from an MCP server are made available to Ollama using the optional toolFilter configuration:

toolFilter (optional): Object with filtering options
- mode: Either "include" (allow-list) or "exclude" (deny-list). Defaults to "include" if not specified.
- tools: Array of exact tool names to include or exclude

Behavior:

If toolFilter is not set or tools array is empty, all tools from the server are loaded (default behavior)
Include mode (allow-list): Only the tools listed in the tools array are made available. If a listed tool is not found on the server, a warning is logged but the server connection continues.
Exclude mode (deny-list): All tools except those listed in the tools array are made available. Listed tools are filtered out.
Tool names must match exactly (case-sensitive)
Invalid mode values cause the application to exit with an error message

Example with include mode (default):

{
  "mcpServers": {
    "weather": {
      "command": "uv",
      "args": ["--directory", "./mock-weather-mcp-server", "run", "main.py"],
      "toolFilter": {
        "tools": ["get_current_temperature", "get_forecast"]
      }
    }
  }
}

Example with explicit modes:

{
  "mcpServers": {
    "weather": {
      "command": "uv",
      "args": ["--directory", "./mock-weather-mcp-server", "run", "main.py"],
      "toolFilter": {
        "mode": "include",
        "tools": ["get_current_temperature"]
      }
    },
    "filesystem": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"],
      "toolFilter": {
        "mode": "exclude",
        "tools": ["delete_file", "write_file"]
      }
    }
  }
}

Variable Expansion

The config also supports simple expansion in any string value:

${workspaceFolder} resolves to the directory containing the config file
${env:VAR_NAME} resolves to the corresponding environment variable

Example:

{
  "mcpServers": {
    "filesystem": {
      "command": "npx",
      "args": [
        "-y",
        "@modelcontextprotocol/server-filesystem",
        "${workspaceFolder}/data"
      ]
    },
    "remote_with_headers": {
      "url": "https://example.com/mcp",
      "headers": {
        "X-Client-Name": "ollama-mcp-bridge",
        "X-Request-Tag": "${env:MCP_REQUEST_TAG}"
      }
    }
  }
}

[!WARNING] Docker Command Limitations: When running in Docker, MCP servers should use commands available in the container:

✅ npx for Node.js-based MCP servers

✅ uvx for Python-based MCP servers

✅ Direct executables in the container

❌ docker commands (unless Docker-in-Docker is configured)

❌ Local file paths from your host machine

CORS Configuration

Configure Cross-Origin Resource Sharing (CORS) to allow requests from your frontend applications:

# Allow all origins (default, not recommended for production)
ollama-mcp-bridge

# Allow specific origins
CORS_ORIGINS="http://localhost:3000,https://myapp.com" ollama-mcp-bridge

# Allow multiple origins with different ports
CORS_ORIGINS="http://localhost:3000,http://localhost:8080,https://app.example.com" ollama-mcp-bridge

CORS Logging:

The bridge logs CORS configuration at startup
Shows warning when using * (all origins)
Shows allowed origins when properly configured

[!WARNING] Using CORS_ORIGINS="*" allows all origins and is not recommended for production. Always specify exact origins for security.

Environment Variables:

CORS_ORIGINS: Comma-separated list of allowed origins (default: *)
- * allows all origins (shows warning in logs)
- Example: CORS_ORIGINS="http://localhost:3000,https://myapp.com" ollama-mcp-bridge
MAX_TOOL_ROUNDS: Maximum number of tool execution rounds (default: unlimited)
- Can be overridden with --max-tool-rounds CLI parameter (CLI takes precedence)
- Example: MAX_TOOL_ROUNDS=5 ollama-mcp-bridge
OLLAMA_URL: URL of the Ollama server (default: http://localhost:11434)
- Can be overridden with --ollama-url CLI parameter
- Useful for Docker deployments and configuration management
- Example: OLLAMA_URL=http://192.168.1.100:11434 ollama-mcp-bridge
OLLAMA_PROXY_TIMEOUT: Timeout for HTTP requests sent to Ollama, in milliseconds (default: unset)
- When unset, the bridge keeps its existing behavior (some requests use library defaults; /api/chat is not timed out)
- When set to a value > 0, the timeout is applied to Ollama-bound HTTP requests
- When set to 0, timeouts are disabled for Ollama HTTP requests (the bridge logs a warning)
- Streaming chat responses always use no timeout, even when this variable is set
- Example (10 minutes): OLLAMA_PROXY_TIMEOUT=600000 ollama-mcp-bridge
SYSTEM_PROMPT: Optional system prompt to prepend to all forwarded /api/chat requests
- Can be set via the SYSTEM_PROMPT environment variable or --system-prompt CLI flag
- If provided, the bridge will prepend a system message (role: system) to the beginning of the messages array for /api/chat requests unless the request already starts with a system message.
- Example: SYSTEM_PROMPT="You are a concise assistant." ollama-mcp-bridge

Usage

[!NOTE] An example MCP server script is provided at mock-weather-mcp-server/main.py.

Start the Server

# Start with default settings (config: ./mcp-config.json, host: 0.0.0.0, port: 8000)
ollama-mcp-bridge

# Start with custom configuration file
ollama-mcp-bridge --config /path/to/custom-config.json

# Custom host and port
ollama-mcp-bridge --host 0.0.0.0 --port 8080

# Custom Ollama server URL (local or cloud)
ollama-mcp-bridge --ollama-url http://192.168.1.100:11434

# Limit tool execution rounds (prevents excessive tool calls)
ollama-mcp-bridge --max-tool-rounds 5

# Set a system prompt to prepend to all /api/chat requests
ollama-mcp-bridge --system-prompt "You are a concise assistant."

# Combine options
ollama-mcp-bridge --config custom.json --host 0.0.0.0 --port 8080 --ollama-url http://remote-ollama:11434 --max-tool-rounds 10

# Check version and available updates
ollama-mcp-bridge --version

[!TIP] If using uvx to run the bridge, you have to specify the command as uvx ollama-mcp-bridge instead of just ollama-mcp-bridge.

[!NOTE] This bridge supports both streaming responses and thinking mode. You receive incremental responses as they are generated, with tool calls and intermediate thinking messages automatically proxied between Ollama and all connected MCP tools.

CLI Options

--config: Path to MCP configuration file (default: mcp-config.json)
--host: Host to bind the server (default: 0.0.0.0)
--port: Port to bind the server (default: 8000)
--ollama-url: Ollama server URL (default: http://localhost:11434)
--max-tool-rounds: Maximum tool execution rounds (default: unlimited)
--reload: Enable auto-reload during development
--version: Show version information, check for updates and exit
--system-prompt: Optional system prompt to prepend to /api/chat requests (default: none)

API Usage

The API is available at http://localhost:8000.

Swagger UI docs: http://localhost:8000/docs
Ollama-compatible endpoints:
- POST /api/chat — Chat endpoint (same as Ollama API, but with MCP tool support)
  - This is the only endpoint where MCP server tools are integrated. All tool calls are handled and responses are merged transparently for the client.
- All other endpoints (except /api/chat, /health, and /version) are fully proxied to the underlying Ollama server with no modification. You can use your existing Ollama clients and libraries as usual.
Bridge-specific endpoints:
- GET /health — Health check endpoint (not proxied)
- GET /version — Version information and update check

[!IMPORTANT] /api/chat is the only endpoint with MCP tool integration. All other endpoints are transparently proxied to Ollama. /health and /version are specific to the bridge.

This bridge acts as a drop-in proxy for the Ollama API, but with all MCP tools from all connected servers available to every /api/chat request. The bridge automatically handles multiple rounds of tool execution until completion, streaming responses in real-time. You can use your existing Ollama clients and libraries with both local and cloud Ollama models, just point them to this bridge instead of your Ollama server.

Example: Chat

curl -N -X POST http://localhost:8000/api/chat \
  -H "accept: application/json" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen3:0.6b",
    "messages": [
      {
        "role": "system",
        "content": "You are a weather assistant."
      },
      {
        "role": "user",
        "content": "What is the weather like in Paris today?"
      }
    ],
    "think": true,
    "stream": true,
    "options": {
      "temperature": 0.7,
      "top_p": 0.9
    }
  }'

[!TIP] Use /docs for interactive API exploration and testing.

Development

Key Dependencies

FastAPI: Modern web framework for the API
Typer: CLI framework for command-line interface
loguru: Structured logging throughout the application
ollama: Python client for Ollama communication
mcp: Model Context Protocol client library
pytest: Testing framework for API validation

Testing

The project has two types of tests:

Unit Tests (GitHub Actions compatible)

# Install test dependencies
uv sync --extra test

# Run unit tests (no server required)
uv run pytest tests/test_unit.py -v

These tests check:

Configuration file loading
Module imports and initialization
Project structure
Tool definition formats

Integration Tests (require running services)

# First, start the server in one terminal
ollama-mcp-bridge

# Then in another terminal, run the integration tests
uv run pytest tests/test_api.py -v

These tests check:

API endpoints with real HTTP requests
End-to-end functionality with Ollama
Tool calling and response integration

Manual Testing

# Quick manual test with curl (server must be running)
curl -X GET "http://localhost:8000/health"

# Check version information and update status
curl -X GET "http://localhost:8000/version"

curl -X POST "http://localhost:8000/api/chat" \
  -H "Content-Type: application/json" \
  -d '{"model": "qwen3:0.6b", "messages": [{"role": "user", "content": "What tools are available?"}]}'

[!NOTE] Tests require the server to be running on localhost:8000. Make sure to start the server before running pytest.

Contributing

We welcome contributions! Please see CONTRIBUTING.md for:

Development setup instructions
Code formatting guidelines (Black)
Testing procedures
Commit conventions

Related Projects

MCP Client for Ollama - A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include multi-server support, dynamic model switching, streaming responses, tool management, human-in-the-loop capabilities, thinking mode, full model parameters configuration, custom system prompt and saved preferences. Built for developers working with local LLMs.
simple-ollama-chat – A simple, user-friendly chat client for Ollama that works seamlessly with the Ollama MCP Bridge. Lets you interact with models and use all MCP server tools integrated via the bridge, with a clean UI and easy setup. Ideal for quickly testing, chatting, and exploring tool-augmented LLMs through the bridge.

Inspiration and Credits

This project is based on the basic MCP client from my Medium article: Build an MCP Client in Minutes: Local AI Agents Just Got Real.

The inspiration to create this simple bridge came from this GitHub issue: jonigl/mcp-client-for-ollama#22, suggested by @nyomen.

Made with ❤️ by jonigl

Related Servers

Scout Monitoring MCP

sponsor

Put performance and error data directly in the hands of your AI assistant.

Alpha Vantage MCP Server

sponsor

Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more

Juniper Junos MCP Server

An MCP server for interacting with Juniper Junos network devices using LLMs.

Project Zomboid MCP Server

An AI-powered MCP server for Project Zomboid mod development, offering script validation, generation, and contextual assistance.

Remote MCP Server on Cloudflare

An example of a remote MCP server deployable on Cloudflare Workers, featuring customizable tools and no authentication.

MCP Messenger

Like n8n for developers

Code Assistant

A server for code modification and generation using Large Language Models.

Kai

Kai provides a bridge between large language models (LLMs) and your Kubernetes clusters, enabling natural language interaction with Kubernetes resources. The server exposes a comprehensive set of tools for managing clusters, namespaces, pods, deployments, services, and other Kubernetes resources

Cloudflare MCP Server

An example MCP server designed for easy deployment on Cloudflare Workers, operating without authentication.

MCP Server Manager for Claude

Install and manage Model Context Protocol (MCP) servers for Claude Desktop.

UIAutomator2 MCP Server

Automate and control Android devices using the UIAutomator2 framework.

MCP Sequence Simulation Server

Simulate DNA and amino acid sequences using evolutionary models and algorithms.

Ollama MCP Bridge

Ollama MCP Bridge

Table of Contents

Features

Requirements

Installation

Quick Start

Or, install from PyPI with pip

Or, run with Docker Compose

Or, install from source

How It Works

Configuration

MCP Servers Configuration

Tool Filtering

Variable Expansion

CORS Configuration

Environment Variables:

Usage

Start the Server

CLI Options

API Usage

Example: Chat

Development

Key Dependencies

Testing

Unit Tests (GitHub Actions compatible)

Integration Tests (require running services)

Manual Testing

Contributing

Related Projects

Inspiration and Credits

Related Servers

Scout Monitoring MCP

Alpha Vantage MCP Server

Juniper Junos MCP Server

Project Zomboid MCP Server

Remote MCP Server on Cloudflare

MCP Messenger

Code Assistant

Kai

Cloudflare MCP Server

MCP Server Manager for Claude

UIAutomator2 MCP Server

MCP Sequence Simulation Server