Chroma MCP Server
An MCP server for the Chroma embedding database, providing persistent, searchable working memory for AI-assisted development with features like automated context recall and codebase indexing.
Chroma MCP Server
A Model Context Protocol (MCP) server integration for Chroma, the open-source embedding database.
Overview
Chroma MCP Server creates a persistent, searchable "working memory" for AI-assisted development:
- Automated Context Recall: AI assistants can query relevant information from past sessions
- Developer-Managed Persistence: Store key decisions and insights in ChromaDB via MCP
- Second Brain Integration: Integrates with IDE workflows to create a unified knowledge hub
Key features:
- Automated Codebase Indexing: Track and index code changes
- Automated Chat Logging: Log AI interactions with enhanced context capture (code diffs, tool sequences)
- Bidirectional Linking: Connect discussions to code changes for tracing feature evolution
- Semantic Code Chunking: Preserve logical code structures for more meaningful context retrieval
- Working Memory Tools: MCP commands for capturing and retrieving development context
- Validation System: Evidence-based validation for code changes and learning promotions
- Automated Test-Driven Learning: Fully automated workflow from test failure to verified fix and learning promotion. See the Pytest Plugin Usage Guide to integrate this into your projects.
See the Getting Started with your Second Brain guide for more details.
Quick Start
Installation
# Basic installation
pip install chroma-mcp-server
# Full installation with all embedding models
pip install "chroma-mcp-server[full]"
Running
# With in-memory storage (data lost on restart)
chroma-mcp-server --client-type ephemeral
# With persistent storage
chroma-mcp-server --client-type persistent --data-dir ./my_data
Cursor Integration
Add or modify .cursor/mcp.json in your project root:
{
"mcpServers": {
"chroma": {
"command": "uvx",
"args": [
"chroma-mcp-server"
],
"env": {
"CHROMA_CLIENT_TYPE": "persistent",
"CHROMA_DATA_DIR": "/path/to/your/data",
"CHROMA_LOG_DIR": "/path/to/your/logs",
"LOG_LEVEL": "INFO",
"MCP_LOG_LEVEL": "INFO",
"MCP_SERVER_LOG_LEVEL": "INFO"
}
}
}
}
Recent Improvements
- Enhanced Context Capture: Automatically extracts code diffs, tool sequences, and assigns confidence scores
- Bidirectional Linking: Creates navigable connections between chat discussions and code changes
- Semantic Code Chunking: Uses logical boundaries (functions, classes) instead of fixed-size chunks
- Server-Side Timestamp Enforcement: Ensures consistent timestamps across all collections
- Automatic Collection Creation: Essential collections (e.g.,
chat_history_v1,codebase_v1) are automatically created on server startup if they don't exist. - Enhanced Logging System: Per-execution log files prevent contamination of JSON communication in stdio mode
- Embedding Function Management: Tools to update collection metadata when changing embedding functions
- Collection Setup Command: Simplifies creation of multiple collections with consistent configuration
- Auto-Promote Workflow: Streamlined derived learning promotion with automatic handling of high-confidence entries
- Smart Defaults: Interactive promotion with intelligent defaults for all fields based on context
- Low Confidence Warnings: Visual indicators for entries that may need more careful review
- Automated Test Workflow: Fully automated capture of test failures, monitoring for fixes, and validated learning promotion
Documentation
Comprehensive documentation is available in the docs directory:
- Main Documentation - Complete guide to installation, configuration, and usage
- Getting Started - Detailed setup instructions
- Developer Guide - For contributors and developers
- IDE & Tool Integration Guides - Guides for integrating with IDEs and other tools.
- Automated Chat Logging - Enriched chat history with bidirectional linking
- Usage Guides - Detailed guides on how to use specific features and workflows.
- Enhanced Context Capture - Details on code diff extraction and tool sequencing
- Semantic Code Chunking - Logic-preserving code chunking for meaningful retrieval
- Automated Test Workflow (Pytest Plugin Usage) - Test-driven learning with automatic validation
- Thinking Tools & Utilities - Documentation for structured thinking and memory tools.
- Client and Developer Scripts - Guides for CLI tools and developer scripts.
- Logging Documentation - Overview of logging features and configuration.
- Server Logging - Details on the improved logging system
- Automation Documentation - Guides on automating development tasks.
- Project Rules & Guidelines - Development rules, guidelines, and best practices.
- Refactoring Plans - Documentation on various refactoring efforts and architectural plans.
- API Reference - Available MCP tools and parameters
License
Chroma MCP Server is licensed under the MIT License with Commons Clause. This means you can:
✅ Allowed:
- Use Chroma MCP Server for any purpose (personal, commercial, academic)
- Modify the code
- Distribute copies
- Create and sell products built using Chroma MCP Server
❌ Not Allowed:
- Sell Chroma MCP Server itself
- Offer Chroma MCP Server as a hosted service
- Create competing products based on Chroma MCP Server
See the LICENSE.md file for the complete license text.
Related Servers
Schema Search
In-memory natural language schema search over database schemas
Fresha
Access the Fresha Data Connector through Snowflake.
CloudBase AI ToolKit
Go from AI prompt to live app in one click. CloudBase AI ToolKit is the bridge that connects your AI IDE (Cursor, Copilot, etc.) directly to Tencent CloudBase.
MCP Database Server
Connects to and interacts with various database systems like SQLite, PostgreSQL, SQL Server, and MongoDB.
Discogs MCP Server
Access the Discogs API for music cataloging, search, and other database operations.
Database Tools for Claude AI
Allows Claude AI to interact directly with MySQL databases.
Chroma
A vector database server powered by Chroma, enabling semantic document search, metadata filtering, and document management.
Knowledge Graph Memory Server
Enables memory for Claude using a knowledge graph with fuzzy semantic search and persistent storage.
Fantasy Premier League
Access Fantasy Premier League (FPL) data and tools, including player information, team details, and gameweek data.
Seq MCP Server
Search and stream events from a Seq server.