Greptile MCP Server

An MCP (Model Context Protocol) server implementation that integrates with the Greptile API to provide code search and querying capabilities to AI agents.

smithery badge

Features

The server provides four essential Greptile tools that enable AI agents to interact with codebases:

index_repository: Index a repository for code search and querying.
- Process a repository to make it searchable
- Update existing indexes when repositories change
- Configure notification preferences
query_repository: Query repositories to get answers with code references.
- Ask natural language questions about the codebase
- Get detailed answers that reference specific code locations
- Support for conversation history with session IDs
search_repository: Search repositories for relevant files without generating a full answer.
- Find files related to specific concepts or features
- Get contextual matches ranked by relevance
- Faster than full queries when only file locations are needed
get_repository_info: Get information about an indexed repository.
- Check indexing status and progress
- Verify which repositories are available for querying
- Get metadata about indexed repositories

Smithery Deployment

The Greptile MCP server supports deployment via Smithery. A smithery.yaml configuration file is included in the project root.

Smithery Configuration

The Smithery configuration is defined in smithery.yaml and supports the following options:

build: dockerfile: Dockerfile

startCommand: type: stdio configSchema: type: object required: - greptileApiKey - githubToken properties: greptileApiKey: type: string description: "API key for accessing the Greptile API" githubToken: type: string description: "GitHub Personal Access Token for repository access" host: type: string description: "Host to bind to when using SSE transport" default: "0.0.0.0" port: type: string description: "Port to listen on when using SSE transport" default: "8050"

Using with Smithery

To deploy using Smithery:

Install Smithery: npm install -g smithery
Deploy the server: smithery deploy
Configure your Smithery client with the required API keys

Prerequisites

Python 3.12+
Greptile API Key (from https://app.greptile.com/settings/api)
GitHub or GitLab Personal Access Token (PAT) with repo (or equivalent read) permissions for the repositories you intend to index
Docker (recommended for deployment)

Required Python Packages

fastmcp - MCP server implementation
httpx - Async HTTP client
python-dotenv - Environment variable management
uvicorn - ASGI server for SSE transport

Installation

Using pip

Clone this repository:
git clone https://github.com/sosacrazy126/greptile-mcp.git
cd greptile-mcp
Create a virtual environment:
python -m venv .venv
source .venv/bin/activate # On Windows use .venv\Scripts\activate
Install dependencies:
pip install -r requirements.txt
Set your environment variables:
export GREPTILE_API_KEY=your_api_key_here
export GITHUB_TOKEN=your_github_token_here

Using Docker

Clone the repository:
git clone https://github.com/sosacrazy126/greptile-mcp.git
cd greptile-mcp
Build the Docker image:
docker build -t greptile-mcp .

Running the Server

The Greptile MCP server supports two modes of operation:

1. MCP Mode (Default)

Traditional MCP server for direct MCP client integration.

Using pip

python -m src.main

Using Docker

docker run --rm -e GREPTILE_API_KEY=your_key -e GITHUB_TOKEN=your_token -p 8050:8050 greptile-mcp

2. HTTP/JSON-RPC Mode (New)

HTTP server providing JSON-RPC 2.0 interface for web applications and REST clients.

Using pip

python -m src.main_http

Using Docker (HTTP Mode)

docker run --rm -e GREPTILE_API_KEY=your_key -e GITHUB_TOKEN=your_token -p 8080:8080 greptile-mcp python -m src.main_http

Development Mode (with auto-reload)

python -m src.main_http --dev

HTTP Mode Features

JSON-RPC 2.0 compliant API at /json-rpc
Interactive documentation at /docs (Swagger UI)
Alternative documentation at /redoc
Health check endpoint at /health
Method documentation at /api/methods
Rate limiting (100 requests/hour per IP)
CORS support for web applications

HTTP Usage Examples

Using curl

Index a repository

curl -X POST http://localhost:8080/json-rpc
-H "Content-Type: application/json"
-d '{ "jsonrpc": "2.0", "method": "index_repository", "params": { "remote": "github", "repository": "facebook/react", "branch": "main" }, "id": "1" }'

Query a repository

curl -X POST http://localhost:8080/json-rpc
-H "Content-Type: application/json"
-d '{ "jsonrpc": "2.0", "method": "query_repository", "params": { "query": "How does useState work?", "repositories": [ { "remote": "github", "repository": "facebook/react", "branch": "main" } ] }, "id": "2" }'

Using JavaScript/fetch

// Index a repository const indexResponse = await fetch('http://localhost:8080/json-rpc', { method: 'POST', headers: { 'Content-Type': 'application/json', }, body: JSON.stringify({ jsonrpc: '2.0', method: 'index_repository', params: { remote: 'github', repository: 'facebook/react', branch: 'main' }, id: '1' }) });

const indexResult = await indexResponse.json(); console.log('Index result:', indexResult);

// Query the repository const queryResponse = await fetch('http://localhost:8080/json-rpc', { method: 'POST', headers: { 'Content-Type': 'application/json', }, body: JSON.stringify({ jsonrpc: '2.0', method: 'query_repository', params: { query: 'How does the component lifecycle work?', repositories: [ { remote: 'github', repository: 'facebook/react', branch: 'main' } ] }, id: '2' }) });

const queryResult = await queryResponse.json(); console.log('Query result:', queryResult);

Using Python requests

import requests import json

Configuration

base_url = "http://localhost:8080/json-rpc" headers = {"Content-Type": "application/json"}

Index a repository

index_payload = { "jsonrpc": "2.0", "method": "index_repository", "params": { "remote": "github", "repository": "facebook/react", "branch": "main" }, "id": "1" }

response = requests.post(base_url, headers=headers, json=index_payload) print("Index result:", response.json())

Query the repository

query_payload = { "jsonrpc": "2.0", "method": "query_repository", "params": { "query": "Explain React hooks", "repositories": [ { "remote": "github", "repository": "facebook/react", "branch": "main" } ] }, "id": "2" }

response = requests.post(base_url, headers=headers, json=query_payload) print("Query result:", response.json())

Integration with MCP Clients

Configure your MCP client to connect to the server:

{ "mcpServers": { "greptile": { "transport": "sse", "url": "http://localhost:8050/sse" } } }

Detailed Usage Guide

Workflow for Codebase Analysis

Index repositories you want to analyze using index_repository
Verify indexing status with get_repository_info to ensure processing is complete
Query the repositories using natural language with query_repository
Find specific files related to features or concepts using search_repository

Session Management for Conversation Context

When interacting with the Greptile MCP server through any client (including Smithery), proper session management is crucial for maintaining conversation context:

Generate a unique session ID at the beginning of a conversation
Reuse the same session ID for all related follow-up queries
Create a new session ID when starting a new conversation

Example session ID management:

Generate a unique session ID

import uuid session_id = str(uuid.uuid4())

Initial query

initial_response = query_repository( query="How is authentication implemented?", repositories=[{"remote": "github", "repository": "owner/repo", "branch": "main"}], session_id=session_id # Include the session ID )

Follow-up query using the SAME session ID

followup_response = query_repository( query="Can you provide more details about the JWT verification?", repositories=[{"remote": "github", "repository": "owner/repo", "branch": "main"}], session_id=session_id # Reuse the same session ID )

Important for Smithery Integration: Agents connecting via Smithery must generate and maintain their own session IDs. The Greptile MCP server does NOT automatically generate session IDs. The session ID should be part of the agent's conversation state.

Best Practices

Indexing Performance: Smaller repositories index faster. For large monorepos, consider indexing specific branches or tags.
Query Optimization: Be specific in your queries. Include relevant technical terms for better results.
Repository Selection: When querying multiple repositories, list them in order of relevance to get the best results.
Session Management: Use session IDs for follow-up questions to maintain context across queries.

API Reference

1. Index Repository

Indexes a repository to make it searchable in future queries.

Parameters:

remote (string): The repository host, either "github" or "gitlab"
repository (string): The repository in owner/repo format (e.g., "greptileai/greptile")
branch (string): The branch to index (e.g., "main")
reload (boolean, optional): Whether to force reprocessing of a previously indexed repository
notify (boolean, optional): Whether to send an email notification when indexing is complete

Example:

// Tool Call: index_repository { "remote": "github", "repository": "greptileai/greptile", "branch": "main", "reload": false, "notify": false }

Response:

{ "message": "Indexing Job Submitted for: greptileai/greptile", "statusEndpoint": "https://api.greptile.com/v2/repositories/github:main:greptileai%2Fgreptile" }

2. Query Repository

Queries repositories with natural language to get answers with code references.

Parameters:

query (string): The natural language query about the codebase
repositories (array): List of repositories to query, each with format:
{
"remote": "github",
"repository": "owner/repo",
"branch": "main"
}
session_id (string, optional): Session ID for continuing a conversation
stream (boolean, optional): Whether to stream the response
genius (boolean, optional): Whether to use enhanced query capabilities

Example:

// Tool Call: query_repository { "query": "How is authentication handled in this codebase?", "repositories": [ { "remote": "github", "repository": "greptileai/greptile", "branch": "main" } ], "session_id": null, "stream": false, "genius": true }

Response:

{ "message": "Authentication in this codebase is handled using JWT tokens...", "sources": [ { "repository": "greptileai/greptile", "remote": "github", "branch": "main", "filepath": "/src/auth/jwt.js", "linestart": 14, "lineend": 35, "summary": "JWT token validation middleware" } ] }

3. Search Repository

Searches repositories to find relevant files without generating a full answer.

Parameters:

query (string): The search query about the codebase
repositories (array): List of repositories to search
session_id (string, optional): Session ID for continuing a conversation
genius (boolean, optional): Whether to use enhanced search capabilities

4. Get Repository Info

Gets information about a specific repository that has been indexed.

Parameters:

remote (string): The repository host, either "github" or "gitlab"
repository (string): The repository in owner/repo format
branch (string): The branch that was indexed

Environment Variables

Variable	Description	Default
GREPTILE_API_KEY	Your Greptile API key	(required)
GITHUB_TOKEN	GitHub/GitLab personal access token	(required)
HOST	Host to bind to	0.0.0.0
PORT	Port to listen on	8050

License

This project is licensed under the MIT License.

Built by @sosacrazy126

Greptile

Greptile MCP Server

Features

Smithery Deployment

Smithery Configuration

Using with Smithery

Prerequisites

Required Python Packages

Installation

Using pip

Using Docker

Running the Server

1. MCP Mode (Default)

Using pip

Using Docker

2. HTTP/JSON-RPC Mode (New)

Using pip

Using Docker (HTTP Mode)

Development Mode (with auto-reload)

HTTP Mode Features

HTTP Usage Examples

Using curl

Index a repository

Query a repository

Using JavaScript/fetch

Using Python requests

Configuration

Index a repository

Query the repository

Integration with MCP Clients

Detailed Usage Guide

Workflow for Codebase Analysis

Session Management for Conversation Context

Generate a unique session ID

Initial query

Follow-up query using the SAME session ID

Best Practices

API Reference

1. Index Repository

2. Query Repository

3. Search Repository

4. Get Repository Info

Environment Variables

License

Related Servers

招投标大数据服务

Minima

Movies Search Tool

Semantic Scholar

Gemini Web Search

Brave Search

Tavily Search

Pearch

企业基础信息服务

Spryker Search Tool