CogmemAi

Persistent cognitive memory for Claude Code. Cloud-based semantic search, Ai-powered extraction, project scoping, and compaction recovery.

npm version License: MIT

CogmemAi — Cognitive Memory for Ai Coding Assistants

Your Ai coding assistant forgets everything between sessions. CogmemAi fixes that.

One command. Your assistant remembers your architecture, patterns, decisions, bugs, and preferences — permanently. Works with Claude Code, Cursor, Windsurf, Cline, Continue, and any MCP-compatible tool. Switch editors, switch models, switch machines — your knowledge stays.

What's New — Intelligence Engine + Auto-Skills (v3.5)

CogmemAi now gets smarter every time you use it. The Intelligence Engine is a self-improving memory system that learns what matters, connects related knowledge automatically, and synthesizes answers from your entire memory. Auto-Skills takes it further — CogmemAi doesn't just remember, it learns how to behave.

Auto-Skills (Closed-Loop Learning)

  • Behavioral skills — CogmemAi automatically synthesizes your corrections, preferences, and patterns into behavioral directives that tell your Ai assistant HOW to work, not just what to know
  • Closed learning loop — correct your assistant once, and CogmemAi detects the pattern. After enough evidence accumulates, it generates a skill that prevents the mistake from ever happening again
  • Confidence tracking — each skill has a confidence score that rises when it works and drops when it doesn't. Low-confidence skills are automatically retired
  • Self-evaluation — skills periodically review themselves against new evidence and adapt, strengthen, or retire as your practices evolve

Intelligence Engine

  • Self-improving recall — memories that consistently help you rank higher over time; memories you never use fade naturally. Your recall quality improves automatically with every session
  • Auto-linking knowledge graph — related memories are automatically connected when you save them. Your knowledge builds into a web of relationships, not a flat list
  • Contradiction detection — when recalled memories conflict with each other, CogmemAi flags the contradiction so you catch stale or outdated information before it causes problems
  • Context-aware ranking — tell CogmemAi what you're doing (debugging, planning, reviewing) and it boosts the right types of memories. Debugging? Bug reports and patterns surface first. Planning? Architecture decisions lead
  • Query synthesis — ask a question and get one coherent answer synthesized from all your relevant memories, not just a list of matches. Like asking a teammate who's read everything
  • Cross-project intelligence — patterns that appear across 3+ projects are automatically promoted to global scope. Your best practices follow you everywhere without manual effort
  • Proactive insights — at session start, CogmemAi tells you what you should know before you ask. Stale critical memories, duplicate subjects that need merging, patterns ready for promotion

Also in v3

  • Memory health score — 0-100 score with actionable factors
  • Session replay — pick up exactly where you left off with automatic session summaries
  • Self-tuning memory — importance adjusts based on real usage; stale memories auto-archive
  • Auto-ingest README — learn from your README on new projects instantly
  • Smart recall — relevant memories surface automatically as you switch topics
  • Auto-learning — CogmemAi learns from your sessions automatically
  • Task tracking — persistent tasks with status and priority
  • Correction learning — teach your assistant to avoid repeated mistakes
  • Session reminders — nudges that surface at the start of your next session
  • 30 tools — the most complete memory toolkit for Ai coding assistants

Quick Start

npx cogmemai-mcp setup

That's it. The setup wizard verifies your API key, configures Claude Code, installs automatic context recovery, and you're ready. Start Claude Code by typing claude and your memories are ready.

Don't have an API key yet? Get one free at hifriendbot.com/developer.

The Problem

Every time you start a new session, you lose context. You re-explain your tech stack, your architecture decisions, your coding preferences. Built-in memory in tools like Claude Code is a flat file with no search, no structure, and no intelligence.

CogmemAi gives your Ai assistant a real memory system:

  • Semantic search — finds relevant memories by meaning, not keywords
  • Ai-powered extraction — automatically identifies facts worth remembering from your conversations
  • Smart deduplication — detects duplicate and conflicting memories automatically
  • Privacy controls — auto-detects API keys, tokens, and secrets before storing
  • Document ingestion — feed in READMEs and docs to instantly build project context
  • Project scoping — memories tied to specific repos, plus global preferences that follow you everywhere
  • Smart context — intelligently ranked for maximum relevance to your current work
  • Compaction recovery — survives Claude Code context compaction automatically
  • Token-efficient — compact context loading that won't bloat your conversation
  • Zero setup — no databases, no Docker, no Python, no vector stores

Why Cloud Memory?

Local memory solutions come with maintenance overhead: database management, version conflicts, storage growth, and setup complexity. CogmemAi runs extraction and search server-side. Your MCP server is a thin HTTP client — zero local databases, zero RAM issues, zero maintenance. All memories are encrypted at rest, so your data is just as secure as a local database — with cloud portability and team features on top.

Your memory follows you everywhere. Memories created in Claude Code are instantly available in Cursor, Windsurf, Cline, and any MCP-compatible tool. Switch between Opus, Sonnet, Haiku, or any model your editor supports — your memories persist regardless. New laptop? New OS? Log in and your full project knowledge is waiting. A local SQLite file dies with your machine. Cloud memory is permanent.

Teams and collaboration. Cloud memory is the only way to share project knowledge across teammates. When one developer saves an architecture decision or documents a bug fix, every team member's Ai assistant knows about it instantly. No syncing, no merge conflicts, no stale local databases. Whether it's two developers or twenty, everyone's assistant has the same up-to-date context. This is impossible with local-only memory solutions.

Compaction Recovery

When your Ai assistant compacts your context, conversation history gets compressed and context is lost. CogmemAi handles this automatically — your context is preserved before compaction and seamlessly restored afterward. No re-explaining, no manual prompting.

The npx cogmemai-mcp setup command configures everything automatically.

Skill

CogmemAi includes a Claude Skill that teaches Claude best practices for memory management — when to save, importance scoring, memory types, and session workflows.

Claude Code:

/skill install https://github.com/hifriendbot/cogmemai-mcp/tree/main/skill/cogmemai-memory

Claude.ai: Upload the skill/cogmemai-memory folder in Settings > Skills.

Claude API: Use the Skills API to attach the skill to your requests.

CLI Commands

npx cogmemai-mcp setup          # Interactive setup wizard
npx cogmemai-mcp setup <key>    # Setup with API key
npx cogmemai-mcp verify         # Test connection and show usage
npx cogmemai-mcp --version      # Show installed version
npx cogmemai-mcp help           # Show all commands

Manual Setup

If you prefer to configure manually instead of using npx cogmemai-mcp setup:

Option A — Per project (add .mcp.json to your project root):

{
  "mcpServers": {
    "cogmemai": {
      "command": "cogmemai-mcp",
      "env": {
        "COGMEMAI_API_KEY": "cm_your_api_key_here"
      }
    }
  }
}

Option B — Global (available in every project):

claude mcp add cogmemai cogmemai-mcp -e COGMEMAI_API_KEY=cm_your_api_key_here --scope user

Works With

Claude Code (Recommended)

Automatic setup:

npx cogmemai-mcp setup

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "cogmemai": {
      "command": "npx",
      "args": ["-y", "cogmemai-mcp"],
      "env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
    }
  }
}

Windsurf

Add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "cogmemai": {
      "command": "npx",
      "args": ["-y", "cogmemai-mcp"],
      "env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
    }
  }
}

Cline (VS Code)

Open VS Code Settings > Cline > MCP Servers, add:

{
  "cogmemai": {
    "command": "npx",
    "args": ["-y", "cogmemai-mcp"],
    "env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
  }
}

Continue

Add to ~/.continue/config.yaml:

mcpServers:
  - name: cogmemai
    command: npx
    args: ["-y", "cogmemai-mcp"]
    env:
      COGMEMAI_API_KEY: cm_your_api_key_here

CogmemUI

CogmemUI is a free multi-model Ai workspace with built-in CogmemAi memory. Add your CogmemAi API key in Settings > API Keys and your memory is instantly available. CogmemUI also supports connecting any MCP-compatible tool server via Settings > MCP Servers — add endpoints, auto-discover tools, and use them in chat.

Get your free API key at hifriendbot.com/developer.

Tools

CogmemAi provides 30 tools that your Ai assistant uses automatically:

ToolDescription
save_memoryStore a fact explicitly (architecture decision, preference, etc.)
recall_memoriesSearch memories using natural language (semantic search)
extract_memoriesAi extracts facts from a conversation exchange automatically
get_project_contextLoad top memories at session start (with smart ranking, health score, and session replay)
list_memoriesBrowse memories with filters (paginated, with untyped filter)
update_memoryUpdate content, importance, scope, type, category, subject, and tags
delete_memoryPermanently delete a memory
bulk_deleteDelete up to 100 memories at once
bulk_updateUpdate up to 50 memories at once (content, type, category, tags, etc.)
get_usageCheck your usage stats and tier info
export_memoriesExport all memories as JSON for backup or transfer
import_memoriesBulk import memories from a JSON array
ingest_documentFeed in a document (README, API docs) to auto-extract memories
save_session_summarySave a summary of what was accomplished in this session
list_tagsView all tags in use across your memories
link_memoriesConnect related memories with named relationships
get_memory_linksExplore the knowledge graph around a memory
get_memory_versionsView edit history of a memory
get_analyticsMemory health dashboard with self-tuning insights (filterable by project)
promote_memoryPromote a project memory to global scope
consolidate_memoriesMerge related memories into comprehensive summaries using Ai
save_taskCreate a persistent task with status and priority tracking
get_tasksRetrieve tasks for the current project — pick up where you left off
update_taskChange task status, priority, or description as you work
save_correctionStore a "wrong approach → right approach" pattern to avoid repeated mistakes
set_reminderSet a reminder that surfaces at the start of your next session
get_stale_memoriesFind memories that may be outdated for review or cleanup
get_file_changesSee what files changed since your last session
feedback_memorySignal whether a recalled memory was useful or irrelevant to improve future recall
generate_skillsTrigger skill generation from your corrections and preferences — or preview candidates with dry run

SDKs

Build your own integrations with the CogmemAi API:

  • JavaScript/TypeScript: npm install cogmemai-sdknpm · GitHub
  • Python: pip install cogmemaiPyPI · GitHub

Memory Types

Memories are categorized for better organization and retrieval:

  • identity — Who you are, your role, team
  • preference — Coding style, tool choices, conventions
  • architecture — System design, tech stack, file structure
  • decision — Why you chose X over Y
  • bug — Known issues, fixes, workarounds
  • dependency — Version constraints, package notes
  • pattern — Reusable patterns, conventions
  • context — General project context
  • task — Persistent tasks with status and priority tracking
  • correction — Wrong approach → right approach patterns
  • reminder — Next-session nudges that auto-expire

Scoping

  • Project memories — Architecture, decisions, bugs specific to one repo. Auto-detected from your repository.
  • Global memories — Your coding preferences, identity, tool choices. Available in every project.

Pricing

FreeProTeamEnterprise
Price$0$14.99/mo$39.99/mo$99.99/mo
Memories5002,00010,00050,000
Extractions/mo5002,0005,00020,000
Projects52050200

Start free. Upgrade when you need more. Or pay per operation with USDC on-chain — no credit card required.

Privacy & Security

  • Encryption at rest. All memories are encrypted before they touch the database. Even in a data breach, your data is unreadable.
  • No source code leaves your machine. We store extracted facts (short sentences), never raw code.
  • API keys cryptographically hashed (irreversible) server-side.
  • All traffic over HTTPS.
  • No model training on your data. Ever.
  • Delete everything instantly via dashboard or MCP tool.
  • No cross-user data sharing.

Read our full privacy policy.

Environment Variables

VariableRequiredDescription
COGMEMAI_API_KEYYesYour API key (starts with cm_)
COGMEMAI_API_URLNoCustom API URL (default: hifriendbot.com)

Support

License

MIT — see LICENSE


Built by HiFriendbot — Better Friends, Better Memories, Better Ai.

Related Servers