CogmemAi
Persistent cognitive memory for Claude Code. Cloud-based semantic search, Ai-powered extraction, project scoping, and compaction recovery.
CogmemAi — Cognitive Memory for Ai Coding Assistants
Your Ai coding assistant forgets everything between sessions. CogmemAi fixes that.
One command. Your assistant remembers your architecture, patterns, decisions, bugs, and preferences — permanently. Works with Claude Code, Cursor, Windsurf, Cline, Continue, and any MCP-compatible tool. Switch editors, switch models, switch machines — your knowledge stays.
What's New — Intelligence Engine + Auto-Skills (v3.5)
CogmemAi now gets smarter every time you use it. The Intelligence Engine is a self-improving memory system that learns what matters, connects related knowledge automatically, and synthesizes answers from your entire memory. Auto-Skills takes it further — CogmemAi doesn't just remember, it learns how to behave.
Auto-Skills (Closed-Loop Learning)
- Behavioral skills — CogmemAi automatically synthesizes your corrections, preferences, and patterns into behavioral directives that tell your Ai assistant HOW to work, not just what to know
- Closed learning loop — correct your assistant once, and CogmemAi detects the pattern. After enough evidence accumulates, it generates a skill that prevents the mistake from ever happening again
- Confidence tracking — each skill has a confidence score that rises when it works and drops when it doesn't. Low-confidence skills are automatically retired
- Self-evaluation — skills periodically review themselves against new evidence and adapt, strengthen, or retire as your practices evolve
Intelligence Engine
- Self-improving recall — memories that consistently help you rank higher over time; memories you never use fade naturally. Your recall quality improves automatically with every session
- Auto-linking knowledge graph — related memories are automatically connected when you save them. Your knowledge builds into a web of relationships, not a flat list
- Contradiction detection — when recalled memories conflict with each other, CogmemAi flags the contradiction so you catch stale or outdated information before it causes problems
- Context-aware ranking — tell CogmemAi what you're doing (debugging, planning, reviewing) and it boosts the right types of memories. Debugging? Bug reports and patterns surface first. Planning? Architecture decisions lead
- Query synthesis — ask a question and get one coherent answer synthesized from all your relevant memories, not just a list of matches. Like asking a teammate who's read everything
- Cross-project intelligence — patterns that appear across 3+ projects are automatically promoted to global scope. Your best practices follow you everywhere without manual effort
- Proactive insights — at session start, CogmemAi tells you what you should know before you ask. Stale critical memories, duplicate subjects that need merging, patterns ready for promotion
Also in v3
- Memory health score — 0-100 score with actionable factors
- Session replay — pick up exactly where you left off with automatic session summaries
- Self-tuning memory — importance adjusts based on real usage; stale memories auto-archive
- Auto-ingest README — learn from your README on new projects instantly
- Smart recall — relevant memories surface automatically as you switch topics
- Auto-learning — CogmemAi learns from your sessions automatically
- Task tracking — persistent tasks with status and priority
- Correction learning — teach your assistant to avoid repeated mistakes
- Session reminders — nudges that surface at the start of your next session
- 30 tools — the most complete memory toolkit for Ai coding assistants
Quick Start
npx cogmemai-mcp setup
That's it. The setup wizard verifies your API key, configures Claude Code, installs automatic context recovery, and you're ready. Start Claude Code by typing claude and your memories are ready.
Don't have an API key yet? Get one free at hifriendbot.com/developer.
The Problem
Every time you start a new session, you lose context. You re-explain your tech stack, your architecture decisions, your coding preferences. Built-in memory in tools like Claude Code is a flat file with no search, no structure, and no intelligence.
CogmemAi gives your Ai assistant a real memory system:
- Semantic search — finds relevant memories by meaning, not keywords
- Ai-powered extraction — automatically identifies facts worth remembering from your conversations
- Smart deduplication — detects duplicate and conflicting memories automatically
- Privacy controls — auto-detects API keys, tokens, and secrets before storing
- Document ingestion — feed in READMEs and docs to instantly build project context
- Project scoping — memories tied to specific repos, plus global preferences that follow you everywhere
- Smart context — intelligently ranked for maximum relevance to your current work
- Compaction recovery — survives Claude Code context compaction automatically
- Token-efficient — compact context loading that won't bloat your conversation
- Zero setup — no databases, no Docker, no Python, no vector stores
Why Cloud Memory?
Local memory solutions come with maintenance overhead: database management, version conflicts, storage growth, and setup complexity. CogmemAi runs extraction and search server-side. Your MCP server is a thin HTTP client — zero local databases, zero RAM issues, zero maintenance. All memories are encrypted at rest, so your data is just as secure as a local database — with cloud portability and team features on top.
Your memory follows you everywhere. Memories created in Claude Code are instantly available in Cursor, Windsurf, Cline, and any MCP-compatible tool. Switch between Opus, Sonnet, Haiku, or any model your editor supports — your memories persist regardless. New laptop? New OS? Log in and your full project knowledge is waiting. A local SQLite file dies with your machine. Cloud memory is permanent.
Teams and collaboration. Cloud memory is the only way to share project knowledge across teammates. When one developer saves an architecture decision or documents a bug fix, every team member's Ai assistant knows about it instantly. No syncing, no merge conflicts, no stale local databases. Whether it's two developers or twenty, everyone's assistant has the same up-to-date context. This is impossible with local-only memory solutions.
Compaction Recovery
When your Ai assistant compacts your context, conversation history gets compressed and context is lost. CogmemAi handles this automatically — your context is preserved before compaction and seamlessly restored afterward. No re-explaining, no manual prompting.
The npx cogmemai-mcp setup command configures everything automatically.
Skill
CogmemAi includes a Claude Skill that teaches Claude best practices for memory management — when to save, importance scoring, memory types, and session workflows.
Claude Code:
/skill install https://github.com/hifriendbot/cogmemai-mcp/tree/main/skill/cogmemai-memory
Claude.ai: Upload the skill/cogmemai-memory folder in Settings > Skills.
Claude API: Use the Skills API to attach the skill to your requests.
CLI Commands
npx cogmemai-mcp setup # Interactive setup wizard
npx cogmemai-mcp setup <key> # Setup with API key
npx cogmemai-mcp verify # Test connection and show usage
npx cogmemai-mcp --version # Show installed version
npx cogmemai-mcp help # Show all commands
Manual Setup
If you prefer to configure manually instead of using npx cogmemai-mcp setup:
Option A — Per project (add .mcp.json to your project root):
{
"mcpServers": {
"cogmemai": {
"command": "cogmemai-mcp",
"env": {
"COGMEMAI_API_KEY": "cm_your_api_key_here"
}
}
}
}
Option B — Global (available in every project):
claude mcp add cogmemai cogmemai-mcp -e COGMEMAI_API_KEY=cm_your_api_key_here --scope user
Works With
Claude Code (Recommended)
Automatic setup:
npx cogmemai-mcp setup
Cursor
Add to ~/.cursor/mcp.json:
{
"mcpServers": {
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
}
Windsurf
Add to ~/.codeium/windsurf/mcp_config.json:
{
"mcpServers": {
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
}
Cline (VS Code)
Open VS Code Settings > Cline > MCP Servers, add:
{
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
Continue
Add to ~/.continue/config.yaml:
mcpServers:
- name: cogmemai
command: npx
args: ["-y", "cogmemai-mcp"]
env:
COGMEMAI_API_KEY: cm_your_api_key_here
CogmemUI
CogmemUI is a free multi-model Ai workspace with built-in CogmemAi memory. Add your CogmemAi API key in Settings > API Keys and your memory is instantly available. CogmemUI also supports connecting any MCP-compatible tool server via Settings > MCP Servers — add endpoints, auto-discover tools, and use them in chat.
Get your free API key at hifriendbot.com/developer.
Tools
CogmemAi provides 30 tools that your Ai assistant uses automatically:
| Tool | Description |
|---|---|
save_memory | Store a fact explicitly (architecture decision, preference, etc.) |
recall_memories | Search memories using natural language (semantic search) |
extract_memories | Ai extracts facts from a conversation exchange automatically |
get_project_context | Load top memories at session start (with smart ranking, health score, and session replay) |
list_memories | Browse memories with filters (paginated, with untyped filter) |
update_memory | Update content, importance, scope, type, category, subject, and tags |
delete_memory | Permanently delete a memory |
bulk_delete | Delete up to 100 memories at once |
bulk_update | Update up to 50 memories at once (content, type, category, tags, etc.) |
get_usage | Check your usage stats and tier info |
export_memories | Export all memories as JSON for backup or transfer |
import_memories | Bulk import memories from a JSON array |
ingest_document | Feed in a document (README, API docs) to auto-extract memories |
save_session_summary | Save a summary of what was accomplished in this session |
list_tags | View all tags in use across your memories |
link_memories | Connect related memories with named relationships |
get_memory_links | Explore the knowledge graph around a memory |
get_memory_versions | View edit history of a memory |
get_analytics | Memory health dashboard with self-tuning insights (filterable by project) |
promote_memory | Promote a project memory to global scope |
consolidate_memories | Merge related memories into comprehensive summaries using Ai |
save_task | Create a persistent task with status and priority tracking |
get_tasks | Retrieve tasks for the current project — pick up where you left off |
update_task | Change task status, priority, or description as you work |
save_correction | Store a "wrong approach → right approach" pattern to avoid repeated mistakes |
set_reminder | Set a reminder that surfaces at the start of your next session |
get_stale_memories | Find memories that may be outdated for review or cleanup |
get_file_changes | See what files changed since your last session |
feedback_memory | Signal whether a recalled memory was useful or irrelevant to improve future recall |
generate_skills | Trigger skill generation from your corrections and preferences — or preview candidates with dry run |
SDKs
Build your own integrations with the CogmemAi API:
- JavaScript/TypeScript:
npm install cogmemai-sdk— npm · GitHub - Python:
pip install cogmemai— PyPI · GitHub
Memory Types
Memories are categorized for better organization and retrieval:
- identity — Who you are, your role, team
- preference — Coding style, tool choices, conventions
- architecture — System design, tech stack, file structure
- decision — Why you chose X over Y
- bug — Known issues, fixes, workarounds
- dependency — Version constraints, package notes
- pattern — Reusable patterns, conventions
- context — General project context
- task — Persistent tasks with status and priority tracking
- correction — Wrong approach → right approach patterns
- reminder — Next-session nudges that auto-expire
Scoping
- Project memories — Architecture, decisions, bugs specific to one repo. Auto-detected from your repository.
- Global memories — Your coding preferences, identity, tool choices. Available in every project.
Pricing
| Free | Pro | Team | Enterprise | |
|---|---|---|---|---|
| Price | $0 | $14.99/mo | $39.99/mo | $99.99/mo |
| Memories | 500 | 2,000 | 10,000 | 50,000 |
| Extractions/mo | 500 | 2,000 | 5,000 | 20,000 |
| Projects | 5 | 20 | 50 | 200 |
Start free. Upgrade when you need more. Or pay per operation with USDC on-chain — no credit card required.
Privacy & Security
- Encryption at rest. All memories are encrypted before they touch the database. Even in a data breach, your data is unreadable.
- No source code leaves your machine. We store extracted facts (short sentences), never raw code.
- API keys cryptographically hashed (irreversible) server-side.
- All traffic over HTTPS.
- No model training on your data. Ever.
- Delete everything instantly via dashboard or MCP tool.
- No cross-user data sharing.
Read our full privacy policy.
Environment Variables
| Variable | Required | Description |
|---|---|---|
COGMEMAI_API_KEY | Yes | Your API key (starts with cm_) |
COGMEMAI_API_URL | No | Custom API URL (default: hifriendbot.com) |
Support
- Issues: GitHub Issues
- Docs: hifriendbot.com/developer
License
MIT — see LICENSE
Built by HiFriendbot — Better Friends, Better Memories, Better Ai.
Related Servers
MemOS
MemOS (Memory Operating System) is a memory management operating system designed for AI applications.
Weather MCP Server
Provides current weather data and allows for city comparisons.
Secure Ubuntu MCP Server
A security-focused MCP server for performing safe operations on an Ubuntu system, featuring robust security controls and audit logging.
Binance MCP Server
Interact with the Binance API to view portfolios, convert tokens, and execute trades with minimal market impact.
Earthdata MCP Server
Interact with NASA Earth Data for efficient dataset discovery and retrieval for geospatial analysis.
YouTube MCP
Manage YouTube videos, create Shorts, and get analytics using the YouTube API.
Amazon Product Advertising API
Integrates with the Amazon Product Advertising API to search for products.
Grok MCP Plugin
Integrate with the Grok AI API to access its powerful language models.
LlamaCloud
Connect to and manage data indexes on the LlamaCloud platform.
CMP MCP Server
An MCP server for CMP API integration, built to run on Cloudflare Workers.