Smriti MCP

Smriti, LLM uygulamaları için kalıcı, grafik tabanlı bellek sağlayan bir Model Context Protocol (MCP) sunucusudur. LadybugDB (gömülü özellik grafik veritabanı) üzerine inşa edilmiş olup, EcphoryRAG esinli çok aşamalı erişim kullanır - ipucu çıkarma, grafik dolaşımı, vektör benzerliği ve çok atlamalı ilişkilendirmeyi birleştirerek insan benzeri bellek hatırlaması sunar.

GitHub

Dokümantasyon

Smriti MCP

Graph-Based AI Memory System with EcphoryRAG Retrieval and Leiden Clustering

Smriti is a Model Context Protocol (MCP) server that provides persistent, graph-based memory for LLM applications. It supports three database backends — LadybugDB (embedded), Neo4j, and FalkorDB — and uses EcphoryRAG-inspired multi-stage retrieval — combining cue extraction, graph traversal, vector similarity, and multi-hop association — to deliver human-like memory recall. Smriti uses the Leiden algorithm for automatic community detection, enabling cluster-aware retrieval that scales beyond thousands of memories.

Features

Graph-Based Memory — Engrams (memories) linked via Cues and Associations in a property graph
EcphoryRAG Retrieval — Multi-hop associative recall with cue extraction, vector similarity, and composite scoring
Leiden Community Detection — Automatic clustering of related memories using the Leiden algorithm with smart-cached resolution tuning, enabling cluster-aware scoring for efficient retrieval at scale
Multi-Backend Support — LadybugDB (embedded, zero-config), Neo4j (enterprise graph DB), or FalkorDB (Redis-based graph DB)
Multi-User Isolation — Per-file (LadybugDB), per-tenant property or per-database (Neo4j), or per-tenant property or per-graph (FalkorDB)
Automatic Consolidation — Exponential decay, pruning of weak memories, strengthening of frequently accessed ones, and periodic Leiden re-clustering
Flexible Backup — GitHub (system git) or S3 (AWS SDK) sync, plus noop for local-only
Lazy HNSW Indexing — Vector and FTS indexes created on-demand when dataset exceeds threshold
OpenAI-Compatible APIs — Works with any OpenAI-compatible LLM and embedding provider
3 MCP Tools — smriti_store, smriti_recall, smriti_manage

Architecture

graph TD
    Client["MCP Client<br/>(Cursor / Claude / Windsurf / etc.)"]
    Client -->|stdio| Server

    subgraph Server["Smriti MCP Server"]
        direction TB

        subgraph Tools["MCP Tools"]
            Store["smriti_store"]
            Recall["smriti_recall"]
            Manage["smriti_manage"]
        end

        subgraph Engine["Memory Engine"]
            Encoding["Encoding<br/>LLM + Embed + Link"]
            Retrieval["Retrieval<br/>Cue Match + Vector + Multi-hop<br/>+ Cluster-Aware Scoring"]
            Consolidation["Consolidation<br/>Decay + Prune + Leiden Clustering"]
        end

        subgraph DB["Graph Database"]
            direction LR
            Graph["(Engram)──[:EncodedBy]──▶(Cue)<br/>(Engram)──[:AssociatedWith]──▶(Engram)<br/>(Cue)──[:CoOccurs]──▶(Cue)"]
            DBType["LadybugDB | Neo4j | FalkorDB"]
        end

        subgraph Backup["Backup Provider (optional)"]
            Git["GitHub (git)"]
            S3["S3 (AWS SDK)"]
            Noop["Noop"]
        end

        Store & Recall & Manage --> Engine
        Encoding & Retrieval & Consolidation --> DB
        DB --> Backup
    end

    LLM["LLM / Embedding API<br/>(OpenAI-compatible)"]
    Engine --> LLM

Recall Pipeline

The default recall mode performs multi-stage retrieval:

Cue Extraction — LLM extracts entities and keywords from the query
Cue-Based Graph Traversal — Follows EncodedBy edges to find engrams linked to matching cues
Vector Similarity Search — Cosine similarity against all engram embeddings (HNSW index when available, fallback to brute-force)
Multi-Hop Expansion — Follows AssociatedWith edges to discover related memories
Cluster-Aware Composite Scoring — Blends vector similarity (40%), recency (20%), importance (20%), and decay (20%), with hop-depth penalty and soft-bounded cross-cluster penalty (0.5x for hop results outside the seed cluster)
Access Strengthening — Recalled engrams get their access count and decay factor bumped (reinforcement)

Leiden Clustering

Smriti uses the Leiden algorithm — an improvement over Louvain that guarantees well-connected communities — to automatically detect clusters of related memories in the graph.

How it works:

Runs automatically during each consolidation cycle
Builds a weighted undirected graph from AssociatedWith edges between engrams
Auto-tunes the resolution parameter using community profiling on the first run
Uses a smart cache: the tuned resolution is reused across runs and only re-tuned when the graph grows by more than 10%
Assigns a cluster_id to each engram, stored persistently in the database
New engrams inherit the cluster_id of their strongest neighbor at encode time

How it improves retrieval:

The recall pipeline determines a seed cluster (most common cluster among direct-match results)
Multi-hop results that cross into a different cluster receive a 0.5x score penalty (soft-bounded: they are penalized, not dropped)
This keeps retrieval focused within the most relevant topic cluster while still allowing cross-topic discovery

Performance characteristics:

Gracefully skips on small graphs (< 3 nodes or 0 edges)
Clustering 60 nodes: ~40ms (first run with auto-tune), ~14ms (cached resolution)
Per-user: each Engine instance maintains its own independent cache

Consolidation Pipeline

Consolidation runs periodically (default: every 3600 seconds) and performs:

Exponential Decay — Reduces decay_factor based on time since last access
Weak Memory Pruning — Removes engrams below minimum decay threshold
Frequency Strengthening — Boosts decay factor for frequently accessed memories
Orphaned Cue Cleanup — Removes cues no longer linked to any engram
Leiden Clustering — Re-clusters the memory graph (smart-cached, skips if graph hasn't changed significantly)
Index Management — Creates HNSW vector and FTS indexes when engram count exceeds threshold (50)

Requirements

Go 1.25+ — For building from source
Git 2.x+ — Required for GitHub backup provider (must be in PATH)
GCC/Build Tools — Required for CGO (LadybugDB backend)
- macOS: xcode-select --install
- Linux: sudo apt install build-essential
- Windows: Use Docker (recommended) or MinGW
liblbug (LadybugDB shared library) — Runtime dependency for LadybugDB backend, downloaded automatically by go-ladybug during build. If building manually, grab the latest release from LadybugDB/ladybug:

Platform Asset Library
macOS liblbug-osx-arm64.tar.gz / liblbug-osx-x86_64.tar.gz liblbug.dylib
Linux liblbug-linux-{arch}.tar.gz liblbug.so
Windows liblbug-windows-x86_64.zip liblbug.dll

The shared library must be on the system library path at runtime (e.g., DYLD_LIBRARY_PATH on macOS, LD_LIBRARY_PATH on Linux, or alongside the binary on Windows). Docker and release binaries bundle this automatically.
Neo4j 5.x+ — Required only when using DB_TYPE=neo4j. Must have APOC and GDS plugins for vector search and full-text indexing.
FalkorDB — Required only when using DB_TYPE=falkordb. Runs on Redis protocol (default port 6379).

Platform	Asset	Library
macOS	`liblbug-osx-arm64.tar.gz` / `liblbug-osx-x86_64.tar.gz`	`liblbug.dylib`
Linux	`liblbug-linux-{arch}.tar.gz`	`liblbug.so`
Windows	`liblbug-windows-x86_64.zip`	`liblbug.dll`

Quick Start

1. Build

# Build
CGO_ENABLED=1 go build -o smriti-mcp .

# Run (minimal config)
export LLM_API_KEY=your-api-key
export ACCESSING_USER=alice
./smriti-mcp

2. MCP Client Integration

Option 1: Native Binary

Cursor (~/.cursor/mcp_settings.json):

{
  "mcpServers": {
    "smriti": {
      "command": "/path/to/smriti-mcp",
      "env": {
        "LLM_API_KEY": "your-api-key",
        "EMBEDDING_API_KEY": "your-embedding-key"
      }
    }
  }
}

Claude Desktop (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "smriti": {
      "command": "/path/to/smriti-mcp",
      "args": [],
      "env": {
        "LLM_API_KEY": "your-api-key",
        "EMBEDDING_API_KEY": "your-embedding-key"
      }
    }
  }
}

Windsurf (~/.codeium/windsurf/mcp_config.json):

{
  "mcpServers": {
    "smriti": {
      "command": "/path/to/smriti-mcp",
      "env": {
        "LLM_API_KEY": "your-api-key",
        "EMBEDDING_API_KEY": "your-embedding-key"
      }
    }
  }
}

Option 2: Go Run

Run directly without installing — similar to npx for Node.js:

{
  "mcpServers": {
    "smriti": {
      "command": "go",
      "args": ["run", "github.com/tejzpr/smriti-mcp@latest"],
      "env": {
        "LLM_API_KEY": "your-api-key",
        "EMBEDDING_API_KEY": "your-embedding-key"
      }
    }
  }
}

Option 3: Docker Container

Simple mode (single user):

{
  "mcpServers": {
    "smriti": {
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "-v", "/Users/yourname/.smriti:/home/smriti/.smriti",
        "-e", "LLM_API_KEY=your-api-key",
        "-e", "EMBEDDING_API_KEY=your-embedding-key",
        "tejzpr/smriti-mcp"
      ]
    }
  }
}

Multi-user mode:

{
  "mcpServers": {
    "smriti": {
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "-v", "/Users/yourname/.smriti:/home/smriti/.smriti",
        "-e", "LLM_API_KEY=your-api-key",
        "-e", "EMBEDDING_API_KEY=your-embedding-key",
        "-e", "ACCESSING_USER=yourname",
        "tejzpr/smriti-mcp"
      ]
    }
  }
}

Note:

Replace /Users/yourname with your actual home directory path

MCP clients do not expand $HOME or ~ in JSON configs — use absolute paths

The .smriti volume mount persists your memory database

The container runs as non-root user smriti

Build locally (optional):

docker build -t smriti-mcp .

Then use smriti-mcp instead of tejzpr/smriti-mcp in your config.

Option 4: GitHub Release Binary

Download pre-built binaries from the Releases page. Binaries are available for:

Platform	Architecture	CGO
Linux	amd64	Enabled (native)
macOS	arm64 (Apple Silicon)	Enabled (native)
Windows	amd64	Enabled (native)

Each release includes a checksums-sha256.txt for verification.

Environment Variables

Core

Variable	Default	Description
`ACCESSING_USER`	OS username	User identifier (used for DB isolation)
`STORAGE_LOCATION`	`~/.smriti`	Root storage directory (LadybugDB only)
`DB_TYPE`	`ladybug`	Database backend: `ladybug`, `neo4j`, or `falkordb`

LLM

Variable	Default	Description
`LLM_BASE_URL`	`https://api.openai.com/v1`	LLM API endpoint (OpenAI-compatible)
`LLM_API_KEY`	(required)	LLM API key
`LLM_MODEL`	`gpt-4o-mini`	LLM model name

Embedding

Variable	Default	Description
`EMBEDDING_BASE_URL`	`https://api.openai.com/v1`	Embedding API endpoint
`EMBEDDING_API_KEY`	(falls back to LLM_API_KEY)	Embedding API key
`EMBEDDING_MODEL`	`text-embedding-3-small`	Embedding model name
`EMBEDDING_DIMS`	`1536`	Embedding vector dimensions

Backup

Variable	Default	Description
`BACKUP_TYPE`	`none`	`none`, `github`, or `s3`
`BACKUP_SYNC_INTERVAL`	`60`	Seconds between backup syncs (0 = disabled)
`GIT_BASE_URL`	(empty)	Git remote base URL (required if `github`)
`S3_ENDPOINT`	(empty)	S3 endpoint (for non-AWS providers)
`S3_REGION`	(empty)	S3 region (required if `s3`)
`S3_ACCESS_KEY`	(empty)	S3 access key (required if `s3`)
`S3_SECRET_KEY`	(empty)	S3 secret key (required if `s3`)

Neo4j (when DB_TYPE=neo4j)

Variable	Default	Description
`NEO4J_URI`	(required)	Bolt URI (e.g. `bolt://localhost:7687`)
`NEO4J_USERNAME`	(required)	Neo4j username
`NEO4J_PASSWORD`	(required)	Neo4j password
`NEO4J_DATABASE`	`neo4j`	Database name (overridden by username in `database` isolation mode)
`NEO4J_ISOLATION`	`tenant`	`tenant` (property-based, Community Edition) or `database` (per-DB, Enterprise Edition)

FalkorDB (when DB_TYPE=falkordb)

Variable	Default	Description
`FALKOR_ADDR`	`localhost:6379`	FalkorDB Redis address
`FALKOR_PASSWORD`	(empty)	FalkorDB password (if auth enabled)
`FALKOR_GRAPH`	`smriti`	Graph name (overridden by `{user}_smriti` in `graph` isolation mode)
`FALKOR_ISOLATION`	`tenant`	`tenant` (property-based) or `graph` (per-graph isolation)

Consolidation

Variable	Default	Description
`CONSOLIDATION_INTERVAL`	`3600`	Seconds between consolidation runs (0 = disabled)

MCP Tools

smriti_store

"Remember this" — Store a new memory. Content is automatically analyzed by the LLM, embedded, and woven into the memory graph. New engrams inherit the cluster_id of their most similar existing neighbor.

{
  "content": "Kubernetes uses etcd as its backing store for all cluster data",
  "importance": 0.8,
  "tags": "kubernetes,etcd,infrastructure",
  "source": "meeting-notes"
}

Parameter	Type	Required	Description
`content`	string	yes	Memory content
`importance`	number	no	Priority 0.0–1.0 (default: 0.5)
`tags`	string	no	Comma-separated tags
`source`	string	no	Source/origin label

smriti_recall

"What do I know about X?" — Retrieve memories using multi-stage EcphoryRAG retrieval with cluster-aware scoring.

{
  "query": "container orchestration tools",
  "limit": 5,
  "mode": "recall"
}

Parameter	Type	Required	Description
`query`	string	no	Natural language query (omit for list mode)
`limit`	number	no	Max results (default: 5)
`mode`	string	no	`recall` (deep multi-hop), `search` (fast vector-only), or `list` (browse)
`memory_type`	string	no	Filter: `episodic`, `semantic`, `procedural`

Modes explained:

recall (default) — Full pipeline: cue extraction → graph traversal → vector search → multi-hop → cluster-aware composite scoring
search — Vector-only cosine similarity. Faster but shallower.
list — No search. Returns recent memories ordered by last access time.

smriti_manage

"Forget this / sync now" — Administrative operations.

{
  "action": "forget",
  "memory_id": "abc-123-def"
}

Parameter	Type	Required	Description
`action`	string	yes	`forget` (delete memory) or `sync` (push backup)
`memory_id`	string	if forget	Engram ID to delete

Graph Schema

Smriti stores memories in a property graph with the following structure:

Node Tables:
  Engram   — id, content, summary, memory_type, importance, access_count,
              created_at, last_accessed_at, decay_factor, embedding, source,
              tags, cluster_id
  Cue      — id, name, cue_type, embedding

Relationship Tables:
  EncodedBy      — (Engram) → (Cue)
  AssociatedWith — (Engram) → (Engram)  [strength, relation_type, created_at]
  CoOccurs       — (Cue) → (Cue)       [strength]

The cluster_id field on Engram nodes is managed by the Leiden algorithm. A value of -1 indicates the engram has not yet been assigned to a cluster (e.g., the graph is too small, or the engram has no associations).

Storage & Isolation

Smriti supports three database backends with different storage and isolation models:

LadybugDB (default)

Each user gets an isolated embedded database file:

~/.smriti/
└── {username}/
    └── memory.lbug     # LadybugDB property graph database

The STORAGE_LOCATION env var controls the root. The ACCESSING_USER env var selects which user's DB to open. Backup providers sync the user directory to remote storage.

Neo4j

Two isolation modes controlled by NEO4J_ISOLATION:

tenant (default) — All users share one database. Each node gets a user property and all queries filter by it. Works on Neo4j Community Edition.
database — Each user gets a separate Neo4j database. Requires Neo4j Enterprise Edition.

FalkorDB

Two isolation modes controlled by FALKOR_ISOLATION:

tenant (default) — All users share one graph. Each node gets a user property and all queries filter by it.
graph — Each user gets a separate graph (named {user}_smriti).

Schema migrations (e.g., adding cluster_id to existing databases) run automatically on startup.

Project Structure

smriti-mcp/
├── main.go              # Entry point, server setup, signal handling
├── config/              # Environment variable parsing
├── llm/                 # OpenAI-compatible HTTP client (LLM + embeddings)
├── db/                  # Database backends (LadybugDB, Neo4j, FalkorDB), schema, indexes, migrations
├── memory/
│   ├── engine.go        # Engine struct, consolidation loop
│   ├── types.go         # Engram, Cue, Association, SearchResult structs
│   ├── encoding.go      # Store pipeline: LLM extraction → embed → link → cluster inherit
│   ├── retrieval.go     # Recall pipeline: cue search → vector → multi-hop → cluster scoring
│   ├── search.go        # Search modes: list, vector-only, FTS, hybrid
│   ├── consolidation.go # Decay, prune, strengthen, orphan cleanup
│   └── leiden.go        # Leiden clustering: graph build, auto-tune, smart cache, batch write
├── backup/              # Backup providers: noop, github (git), s3 (AWS SDK)
├── tools/               # MCP tool definitions: store, recall, manage
└── testutil/            # Shared test helpers

Testing

# Run unit tests
CGO_ENABLED=1 go test ./...

# Verbose with all output
CGO_ENABLED=1 go test -v ./...

# Specific package
CGO_ENABLED=1 go test -v ./memory/...
CGO_ENABLED=1 go test -v ./tools/...

# Leiden clustering tests only
CGO_ENABLED=1 go test -v -run "TestRunLeiden|TestNeedsRetune|TestDetermineSeedCluster" ./memory/

E2E / Integration Tests

E2E tests require real LLM/embedding services and are gated behind the integration build tag:

# LadybugDB E2E (no external DB required)
CGO_ENABLED=1 go test -tags integration -v -run "TestE2E_LadybugDB" ./memory/

# Neo4j E2E (requires running Neo4j instance)
NEO4J_URI="bolt://localhost:7687" NEO4J_USERNAME="neo4j" NEO4J_PASSWORD="yourpass" \
  CGO_ENABLED=1 go test -tags integration -v -run "TestE2E_Neo4j" ./memory/

# FalkorDB E2E (requires running FalkorDB instance)
FALKOR_ADDR="localhost:6379" \
  CGO_ENABLED=1 go test -tags integration -v -run "TestE2E_FalkorDB" ./memory/

# All E2E tests
CGO_ENABLED=1 go test -tags integration -v -run "TestE2E_" ./memory/

All E2E tests require LLM_BASE_URL, LLM_API_KEY, LLM_MODEL, EMBEDDING_BASE_URL, EMBEDDING_MODEL, and EMBEDDING_API_KEY environment variables.

Contributing

Contributions are welcome! Please ensure:

All tests pass (CGO_ENABLED=1 go test ./...)
Code is properly formatted (go fmt ./...)
New code includes the SPDX license header

See CONTRIBUTORS.md for the contributor list.

License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0) from v1.0.7 onwards.

Versions prior to v1.0.7 are licensed under the Mozilla Public License 2.0.