Ollama MCP Server

A bridge to use local LLMs from Ollama within the Model Context Protocol.

Ollama MCP Server

🚀 A powerful bridge between Ollama and the Model Context Protocol (MCP), enabling seamless integration of Ollama's local LLM capabilities into your MCP-powered applications.

🌟 Features

Complete Ollama Integration

Full API Coverage: Access all essential Ollama functionality through a clean MCP interface
OpenAI-Compatible Chat: Drop-in replacement for OpenAI's chat completion API
Local LLM Power: Run AI models locally with full control and privacy

Core Capabilities

🔄 Model Management
- Pull models from registries
- Push models to registries
- List available models
- Create custom models from Modelfiles
- Copy and remove models
🤖 Model Execution
- Run models with customizable prompts
- Chat completion API with system/user/assistant roles
- Configurable parameters (temperature, timeout)
- Raw mode support for direct responses
🛠 Server Control
- Start and manage Ollama server
- View detailed model information
- Error handling and timeout management

🚀 Getting Started

Prerequisites

Ollama installed on your system
Node.js and npm/pnpm

Installation

Install dependencies:

pnpm install

Build the server:

pnpm run build

Configuration

Add the server to your MCP configuration:

For Claude Desktop:

MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "ollama": {
      "command": "node",
      "args": ["/path/to/ollama-server/build/index.js"],
      "env": {
        "OLLAMA_HOST": "http://127.0.0.1:11434"  // Optional: customize Ollama API endpoint
      }
    }
  }
}

🛠 Usage Examples

Pull and Run a Model

// Pull a model
await mcp.use_mcp_tool({
  server_name: "ollama",
  tool_name: "pull",
  arguments: {
    name: "llama2"
  }
});

// Run the model
await mcp.use_mcp_tool({
  server_name: "ollama",
  tool_name: "run",
  arguments: {
    name: "llama2",
    prompt: "Explain quantum computing in simple terms"
  }
});

Chat Completion (OpenAI-compatible)

await mcp.use_mcp_tool({
  server_name: "ollama",
  tool_name: "chat_completion",
  arguments: {
    model: "llama2",
    messages: [
      {
        role: "system",
        content: "You are a helpful assistant."
      },
      {
        role: "user",
        content: "What is the meaning of life?"
      }
    ],
    temperature: 0.7
  }
});

Create Custom Model

await mcp.use_mcp_tool({
  server_name: "ollama",
  tool_name: "create",
  arguments: {
    name: "custom-model",
    modelfile: "./path/to/Modelfile"
  }
});

🔧 Advanced Configuration

OLLAMA_HOST: Configure custom Ollama API endpoint (default: http://127.0.0.1:11434)
Timeout settings for model execution (default: 60 seconds)
Temperature control for response randomness (0-2 range)

🤝 Contributing

Contributions are welcome! Feel free to:

Report bugs
Suggest new features
Submit pull requests

📝 License

MIT License - feel free to use in your own projects!

Built with ❤️ for the MCP ecosystem

Related Servers

Scout Monitoring MCP

sponsor

Put performance and error data directly in the hands of your AI assistant.

Alpha Vantage MCP Server

sponsor

Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more

MCP Simple OpenAI Assistant

A simple server for interacting with OpenAI assistants using an API key.

OpenAPI Schema Explorer

Token-efficient access to OpenAPI/Swagger specs via MCP Resources

Remote MCP Server (Authless)

An example of a remote MCP server deployable on Cloudflare Workers, without authentication.

Deepseek Thinker

Provides Deepseek's reasoning capabilities to AI clients, supporting both the Deepseek API and local Ollama server modes.

Onyx MCP Server

Search and query Onyx programming language documentation and GitHub code examples.

Vibe Stack MCP

Helps developers choose the right tech stack for their projects with personalized recommendations.

Prefect

Interact with the Prefect API for workflow orchestration and management.

Bitrix24 MCP-DEV

The MCP server for Bitrix24 provides AI assistants with structured access to the Bitrix24 API. It delivers up-to-date method descriptions, parameters, and valid values, allowing assistants to work with precise data instead of guesswork. This reduces code errors and accelerates Bitrix24 integration development.

Sui MCP Tools

A toolkit for interacting with the Sui blockchain and integrating MCP SDK features, with support for multiple network environments.

go-mcp実験場

A Go-based MCP server example demonstrating correct usage of go.mod and build/run commands.