bulhufas
MCP server that ingests project docs once and lets Claude search by meaning instead of reading everything — saving tokens on large codebases
bulhufas
RAG-powered project management that captures what PM tools miss.
Getting Started · How It Works · API · Self-Host · Contributing
Why "bulhufas"?
Bulhufas is Brazilian Portuguese slang for "zilch", "diddly-squat", "bupkis" — absolutely nothing.
As in: "How much does Claude know about that decision your team made on WhatsApp last Thursday?" Bulhufas.
"What about that blocker someone mentioned in standup?" Bulhufas.
"And that architecture decision from two sprints ago?" You guessed it. Bulhufas.
Now it knows.
Teams make decisions in Slack, WhatsApp, and meetings — then none of it reaches the PM tool. bulhufas captures raw conversations, extracts structured project artifacts (decisions, action items, blockers, scope changes), and makes them searchable via semantic embeddings.
Single binary. No external dependencies. Embeddings run in-process.
Features
- Conversation to Structure — Paste raw chat, get structured chunks: decisions, action items, blockers, requirements, scope changes
- Semantic Search — Find context by meaning, not keywords. "What did we decide about auth?" finds the right chunk even if "auth" isn't in the text
- CRUD on Knowledge — Update status, add context, archive outdated chunks. Your knowledge base stays current
- Single Binary — One Go binary with embedded vector store (chromem-go) and embedding model (hugot/all-MiniLM-L6-v2). No Ollama, no Docker, no external processes
- Self-Hostable — Deploy anywhere: Coolify, Railway, Hetzner, AWS, GCP. Runs on a 2GB VPS
How It Works

You paste a conversation into your AI assistant
|
The LLM extracts structured chunks with metadata
|
bulhufas stores chunks + generates embeddings in-process (hugot)
|
Later: "what's pending from last week?" -> semantic search returns relevant chunks
What Gets Captured
| Chunk Type | Example |
|---|---|
decision | "We chose WebSockets over polling for real-time updates" |
action_item | "Hugo will create read-only DB credentials by Friday" |
blocker | "Can't deploy until the SSL cert is renewed" |
requirement | "Client needs CSV export for the finance report" |
scope_change | "Auth module expanded to include SSO" |
context | "The legacy API returns XML, not JSON" |
research_finding | "pgvector outperforms pinecone for our dataset size" |
status_update | "Payment integration is live in staging" |
Installation
1. Build from source
# Requires Go 1.22+ with CGO enabled
git clone https://github.com/HugoluizMTB/bulhufas.git
cd bulhufas
make build
2. Add to Claude Code
claude mcp add --transport stdio --scope user bulhufas -- /absolute/path/to/bulhufas/bin/bulhufas --mcp
Replace
/absolute/path/towith the actual path where you cloned the repo. Use--scope userto make it available across all your projects. Use--scope projectto restrict it to the current project only.
3. Restart Claude Code and verify
Run /mcp inside Claude Code. You should see bulhufas connected with 6 tools:
| Tool | Description |
|---|---|
save_conversation | Save a conversation with extracted structured chunks |
search | Semantic search across all stored chunks |
list_chunks | List chunks with optional type/status filters |
update_status | Update chunk status by ID |
delete_chunk | Delete a chunk by ID |
list_actions | List all pending action items |
On first run, the embedding model (all-MiniLM-L6-v2, ~80MB) is downloaded automatically to ./data/models/.
Run as HTTP Server (optional)
./bin/bulhufas
Starts an HTTP API on port 8420. Use --mcp flag for MCP stdio mode instead.
Environment Variables
| Variable | Default | Description |
|---|---|---|
PORT | 8420 | Server port |
DATA_DIR | ./data | Persistent storage directory (SQLite db, model files, vector index) |
API
Save a conversation with chunks
curl -X POST http://localhost:8420/api/conversations \
-H "Content-Type: application/json" \
-d '{
"source": "whatsapp",
"summary": "Discussion about database access",
"participants": ["renan", "hugo"],
"chunks": [
{
"content": "Renan needs read-only access to PostgreSQL",
"type": "decision",
"tags": ["infra", "postgres"],
"people": ["renan"],
"status": "pending",
"action_item": "Create read-only credentials"
}
]
}'
Semantic search
curl -X POST http://localhost:8420/api/search \
-H "Content-Type: application/json" \
-d '{"text": "database access", "limit": 5}'
List chunks with filters
curl "http://localhost:8420/api/chunks?type=blocker&status=pending"
List pending action items
curl http://localhost:8420/api/actions
Update chunk status
curl -X PATCH http://localhost:8420/api/chunks/{id}/status \
-H "Content-Type: application/json" \
-d '{"status": "resolved"}'
Delete a chunk
curl -X DELETE http://localhost:8420/api/chunks/{id}
Health check
curl http://localhost:8420/healthz
Architecture
cmd/server/ -> entrypoint, wires everything together
internal/
domain/ -> core types: Conversation, Chunk, WorkItem, Relation
mcp/ -> HTTP server, handlers, request/response logic
store/ -> persistence interface + SQLite implementation
vectorstore/ -> embedded vector search via chromem-go
embedder/ -> in-process embeddings via hugot (all-MiniLM-L6-v2)
scripts/ -> test scripts
web/ -> React dashboard (future)
All external dependencies are behind interfaces. Swap SQLite for Postgres, or chromem-go for pgvector — without touching business logic.
Stack
| Component | Library | Runs in-process? |
|---|---|---|
| Embedding | hugot + all-MiniLM-L6-v2 (384 dim) | Yes |
| Vector store | chromem-go | Yes |
| Database | SQLite (mattn/go-sqlite3) | Yes |
| HTTP server | Go stdlib net/http | Yes |
No Ollama. No Docker. No external databases. One binary.
Self-Hosting
Binary
CGO_ENABLED=1 GOOS=linux go build -o bulhufas ./cmd/server
scp bulhufas your-server:/opt/bulhufas/
ssh your-server '/opt/bulhufas/bulhufas'
Docker
docker build -t bulhufas .
docker run -d --name bulhufas -p 8420:8420 -v bulhufas-data:/data bulhufas
Docker Compose
git clone https://github.com/HugoluizMTB/bulhufas.git
cd bulhufas
docker compose up -d
Works with Coolify, Railway, Hetzner, AWS, GCP, Oracle Cloud — anything that runs Docker.
Roadmap
- Core domain types and interfaces
- HTTP API with save/search/update/delete
- SQLite store implementation
- In-process embeddings via hugot (all-MiniLM-L6-v2)
- chromem-go vector store
- Semantic search with SQLite enrichment
- Action items endpoint
- MCP server protocol (stdio transport via mcp-go)
- Docker image
- React dashboard
- Slack plugin
- Remote MCP via SSE transport
Contributing
See CONTRIBUTING.md for setup instructions, code style, and PR process.
License
Apache License 2.0 — use it freely, even commercially. Patent protection included.
Created by @HugoluizMTB
Serveurs connexes
Alpha Vantage MCP Server
sponsorAccess financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
Pathmode
Build structured intent specs through Socratic AI conversation. Describe a problem or paste a support ticket — Claude challenges vague thinking, asks pointed questions, and builds a structured spec. Exports as intent.md, .cursorrules, or CLAUDE.md.
Photon
A TypeScript framework that turns a single class into an MCP server, CLI tool, and web dashboard with a marketplace of 35 ready-made photons.
third-eye-mcp
Privacy-first screen capture MCP server for AI coding agents. Let Claude, Cursor, or any MCP-compatible AI see your screen with full control.
dr-mcp
Clean and audit messy MCP setups: stale servers, unused tools, context-heavy MCPs, outdated packages, and safe repair plans.
Petclinic
Interacts with the Swagger Petstore API using Petclinic v3 APIs, exposing tools for OpenAI models.
FileForge
Transforms raw code into polished solutions with optimized performance and vector embeddings support.
seite
AI-native static site generator with built-in MCP server. Build sites, create content, apply themes, search docs, and deploy via Claude Code or any MCP client.
Gemini Imagen 3.0
Generate high-quality images using Google's Imagen 3.0 model via the Gemini API.
Contentrain MCP
Extract, govern, and ship structured content from your codebase.
Bevy BRP MCP
Control, inspect, and mutate Bevy applications with AI coding assistants via the Bevy Remote Protocol (BRP).