Chroma MCP Server
An MCP server for the Chroma embedding database, providing persistent, searchable working memory for AI-assisted development with features like automated context recall and codebase indexing.
Chroma MCP Server
A Model Context Protocol (MCP) server integration for Chroma, the open-source embedding database.
Overview
Chroma MCP Server creates a persistent, searchable "working memory" for AI-assisted development:
- Automated Context Recall: AI assistants can query relevant information from past sessions
- Developer-Managed Persistence: Store key decisions and insights in ChromaDB via MCP
- Second Brain Integration: Integrates with IDE workflows to create a unified knowledge hub
Key features:
- Automated Codebase Indexing: Track and index code changes
- Automated Chat Logging: Log AI interactions with enhanced context capture (code diffs, tool sequences)
- Bidirectional Linking: Connect discussions to code changes for tracing feature evolution
- Semantic Code Chunking: Preserve logical code structures for more meaningful context retrieval
- Working Memory Tools: MCP commands for capturing and retrieving development context
- Validation System: Evidence-based validation for code changes and learning promotions
- Automated Test-Driven Learning: Fully automated workflow from test failure to verified fix and learning promotion. See the Pytest Plugin Usage Guide to integrate this into your projects.
See the Getting Started with your Second Brain guide for more details.
Quick Start
Installation
# Basic installation
pip install chroma-mcp-server
# Full installation with all embedding models
pip install "chroma-mcp-server[full]"
Running
# With in-memory storage (data lost on restart)
chroma-mcp-server --client-type ephemeral
# With persistent storage
chroma-mcp-server --client-type persistent --data-dir ./my_data
Cursor Integration
Add or modify .cursor/mcp.json in your project root:
{
"mcpServers": {
"chroma": {
"command": "uvx",
"args": [
"chroma-mcp-server"
],
"env": {
"CHROMA_CLIENT_TYPE": "persistent",
"CHROMA_DATA_DIR": "/path/to/your/data",
"CHROMA_LOG_DIR": "/path/to/your/logs",
"LOG_LEVEL": "INFO",
"MCP_LOG_LEVEL": "INFO",
"MCP_SERVER_LOG_LEVEL": "INFO"
}
}
}
}
Recent Improvements
- Enhanced Context Capture: Automatically extracts code diffs, tool sequences, and assigns confidence scores
- Bidirectional Linking: Creates navigable connections between chat discussions and code changes
- Semantic Code Chunking: Uses logical boundaries (functions, classes) instead of fixed-size chunks
- Server-Side Timestamp Enforcement: Ensures consistent timestamps across all collections
- Automatic Collection Creation: Essential collections (e.g.,
chat_history_v1,codebase_v1) are automatically created on server startup if they don't exist. - Enhanced Logging System: Per-execution log files prevent contamination of JSON communication in stdio mode
- Embedding Function Management: Tools to update collection metadata when changing embedding functions
- Collection Setup Command: Simplifies creation of multiple collections with consistent configuration
- Auto-Promote Workflow: Streamlined derived learning promotion with automatic handling of high-confidence entries
- Smart Defaults: Interactive promotion with intelligent defaults for all fields based on context
- Low Confidence Warnings: Visual indicators for entries that may need more careful review
- Automated Test Workflow: Fully automated capture of test failures, monitoring for fixes, and validated learning promotion
Documentation
Comprehensive documentation is available in the docs directory:
- Main Documentation - Complete guide to installation, configuration, and usage
- Getting Started - Detailed setup instructions
- Developer Guide - For contributors and developers
- IDE & Tool Integration Guides - Guides for integrating with IDEs and other tools.
- Automated Chat Logging - Enriched chat history with bidirectional linking
- Usage Guides - Detailed guides on how to use specific features and workflows.
- Enhanced Context Capture - Details on code diff extraction and tool sequencing
- Semantic Code Chunking - Logic-preserving code chunking for meaningful retrieval
- Automated Test Workflow (Pytest Plugin Usage) - Test-driven learning with automatic validation
- Thinking Tools & Utilities - Documentation for structured thinking and memory tools.
- Client and Developer Scripts - Guides for CLI tools and developer scripts.
- Logging Documentation - Overview of logging features and configuration.
- Server Logging - Details on the improved logging system
- Automation Documentation - Guides on automating development tasks.
- Project Rules & Guidelines - Development rules, guidelines, and best practices.
- Refactoring Plans - Documentation on various refactoring efforts and architectural plans.
- API Reference - Available MCP tools and parameters
License
Chroma MCP Server is licensed under the MIT License with Commons Clause. This means you can:
✅ Allowed:
- Use Chroma MCP Server for any purpose (personal, commercial, academic)
- Modify the code
- Distribute copies
- Create and sell products built using Chroma MCP Server
❌ Not Allowed:
- Sell Chroma MCP Server itself
- Offer Chroma MCP Server as a hosted service
- Create competing products based on Chroma MCP Server
See the LICENSE.md file for the complete license text.
Related Servers
Opera Omnia
Access a rich collection of JSON datasets for games, storytelling, and bot development from the Opera Omnia project.
SAP Fieldglass MCP Server by CData
A read-only MCP server for querying live SAP Fieldglass data, powered by the CData JDBC Driver.
Data Exploration
MCP server for autonomous data exploration on .csv-based datasets, providing intelligent insights with minimal effort.
PawSQL MCP Server
A SQL optimization service providing performance analysis and optimization suggestions through an API.
Google Directory by CData
A read-only MCP server for querying live Google Directory data using the CData JDBC Driver.
CData SAP HANA XS Advanced Server
A read-only MCP server for SAP HANA XS Advanced, powered by the CData JDBC Driver.
NCBI Entrez MCP Server
Access NCBI's suite of APIs, including E-utilities, BLAST, PubChem, and PMC services.
MCP Memory Server
An advanced memory system for Claude Desktop that provides persistent memory using MCP. Requires an Azure Cosmos DB account and an OpenAI API key.
Microsoft Fabric Analytics
An analytics server providing tools for interacting with the Microsoft Fabric data platform.
dbt CLI
An MCP server that wraps the dbt CLI, allowing AI agents to interact with dbt projects.