bulhufas
MCP server that ingests project docs once and lets Claude search by meaning instead of reading everything — saving tokens on large codebases
bulhufas
RAG-powered project management that captures what PM tools miss.
Getting Started · How It Works · API · Self-Host · Contributing
Why "bulhufas"?
Bulhufas is Brazilian Portuguese slang for "zilch", "diddly-squat", "bupkis" — absolutely nothing.
As in: "How much does Claude know about that decision your team made on WhatsApp last Thursday?" Bulhufas.
"What about that blocker someone mentioned in standup?" Bulhufas.
"And that architecture decision from two sprints ago?" You guessed it. Bulhufas.
Now it knows.
Teams make decisions in Slack, WhatsApp, and meetings — then none of it reaches the PM tool. bulhufas captures raw conversations, extracts structured project artifacts (decisions, action items, blockers, scope changes), and makes them searchable via semantic embeddings.
Single binary. No external dependencies. Embeddings run in-process.
Features
- Conversation to Structure — Paste raw chat, get structured chunks: decisions, action items, blockers, requirements, scope changes
- Semantic Search — Find context by meaning, not keywords. "What did we decide about auth?" finds the right chunk even if "auth" isn't in the text
- CRUD on Knowledge — Update status, add context, archive outdated chunks. Your knowledge base stays current
- Single Binary — One Go binary with embedded vector store (chromem-go) and embedding model (hugot/all-MiniLM-L6-v2). No Ollama, no Docker, no external processes
- Self-Hostable — Deploy anywhere: Coolify, Railway, Hetzner, AWS, GCP. Runs on a 2GB VPS
How It Works

You paste a conversation into your AI assistant
|
The LLM extracts structured chunks with metadata
|
bulhufas stores chunks + generates embeddings in-process (hugot)
|
Later: "what's pending from last week?" -> semantic search returns relevant chunks
What Gets Captured
| Chunk Type | Example |
|---|---|
decision | "We chose WebSockets over polling for real-time updates" |
action_item | "Hugo will create read-only DB credentials by Friday" |
blocker | "Can't deploy until the SSL cert is renewed" |
requirement | "Client needs CSV export for the finance report" |
scope_change | "Auth module expanded to include SSO" |
context | "The legacy API returns XML, not JSON" |
research_finding | "pgvector outperforms pinecone for our dataset size" |
status_update | "Payment integration is live in staging" |
Installation
1. Build from source
# Requires Go 1.22+ with CGO enabled
git clone https://github.com/HugoluizMTB/bulhufas.git
cd bulhufas
make build
2. Add to Claude Code
claude mcp add --transport stdio --scope user bulhufas -- /absolute/path/to/bulhufas/bin/bulhufas --mcp
Replace
/absolute/path/towith the actual path where you cloned the repo. Use--scope userto make it available across all your projects. Use--scope projectto restrict it to the current project only.
3. Restart Claude Code and verify
Run /mcp inside Claude Code. You should see bulhufas connected with 6 tools:
| Tool | Description |
|---|---|
save_conversation | Save a conversation with extracted structured chunks |
search | Semantic search across all stored chunks |
list_chunks | List chunks with optional type/status filters |
update_status | Update chunk status by ID |
delete_chunk | Delete a chunk by ID |
list_actions | List all pending action items |
On first run, the embedding model (all-MiniLM-L6-v2, ~80MB) is downloaded automatically to ./data/models/.
Run as HTTP Server (optional)
./bin/bulhufas
Starts an HTTP API on port 8420. Use --mcp flag for MCP stdio mode instead.
Environment Variables
| Variable | Default | Description |
|---|---|---|
PORT | 8420 | Server port |
DATA_DIR | ./data | Persistent storage directory (SQLite db, model files, vector index) |
API
Save a conversation with chunks
curl -X POST http://localhost:8420/api/conversations \
-H "Content-Type: application/json" \
-d '{
"source": "whatsapp",
"summary": "Discussion about database access",
"participants": ["renan", "hugo"],
"chunks": [
{
"content": "Renan needs read-only access to PostgreSQL",
"type": "decision",
"tags": ["infra", "postgres"],
"people": ["renan"],
"status": "pending",
"action_item": "Create read-only credentials"
}
]
}'
Semantic search
curl -X POST http://localhost:8420/api/search \
-H "Content-Type: application/json" \
-d '{"text": "database access", "limit": 5}'
List chunks with filters
curl "http://localhost:8420/api/chunks?type=blocker&status=pending"
List pending action items
curl http://localhost:8420/api/actions
Update chunk status
curl -X PATCH http://localhost:8420/api/chunks/{id}/status \
-H "Content-Type: application/json" \
-d '{"status": "resolved"}'
Delete a chunk
curl -X DELETE http://localhost:8420/api/chunks/{id}
Health check
curl http://localhost:8420/healthz
Architecture
cmd/server/ -> entrypoint, wires everything together
internal/
domain/ -> core types: Conversation, Chunk, WorkItem, Relation
mcp/ -> HTTP server, handlers, request/response logic
store/ -> persistence interface + SQLite implementation
vectorstore/ -> embedded vector search via chromem-go
embedder/ -> in-process embeddings via hugot (all-MiniLM-L6-v2)
scripts/ -> test scripts
web/ -> React dashboard (future)
All external dependencies are behind interfaces. Swap SQLite for Postgres, or chromem-go for pgvector — without touching business logic.
Stack
| Component | Library | Runs in-process? |
|---|---|---|
| Embedding | hugot + all-MiniLM-L6-v2 (384 dim) | Yes |
| Vector store | chromem-go | Yes |
| Database | SQLite (mattn/go-sqlite3) | Yes |
| HTTP server | Go stdlib net/http | Yes |
No Ollama. No Docker. No external databases. One binary.
Self-Hosting
Binary
CGO_ENABLED=1 GOOS=linux go build -o bulhufas ./cmd/server
scp bulhufas your-server:/opt/bulhufas/
ssh your-server '/opt/bulhufas/bulhufas'
Docker
docker build -t bulhufas .
docker run -d --name bulhufas -p 8420:8420 -v bulhufas-data:/data bulhufas
Docker Compose
git clone https://github.com/HugoluizMTB/bulhufas.git
cd bulhufas
docker compose up -d
Works with Coolify, Railway, Hetzner, AWS, GCP, Oracle Cloud — anything that runs Docker.
Roadmap
- Core domain types and interfaces
- HTTP API with save/search/update/delete
- SQLite store implementation
- In-process embeddings via hugot (all-MiniLM-L6-v2)
- chromem-go vector store
- Semantic search with SQLite enrichment
- Action items endpoint
- MCP server protocol (stdio transport via mcp-go)
- Docker image
- React dashboard
- Slack plugin
- Remote MCP via SSE transport
Contributing
See CONTRIBUTING.md for setup instructions, code style, and PR process.
License
Apache License 2.0 — use it freely, even commercially. Patent protection included.
Created by @HugoluizMTB
相關伺服器
Scout Monitoring MCP
贊助Put performance and error data directly in the hands of your AI assistant.
Alpha Vantage MCP Server
贊助Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
Apktool
A server for Android APK analysis and reverse engineering using Apktool.
Page Design Guide MCP Server
MCP server providing modern web design guidance, color psychology, typography, layout patterns, and UI best practices for AI assistants
Code Assistant
A server for code modification and generation using Large Language Models.
Figma
Integrate Figma design data with AI coding tools using a local MCP server.
Remote MCP Server (Authless)
An example of a remote MCP server deployable on Cloudflare Workers, without authentication.
Sionic AI Serverless RAG
Integrates LLM applications with RAG data sources and tools using the Model Context Protocol.
Puppeteer MCP
MCP server for browser automation via Puppeteer
MCP Proxy Server
Aggregates multiple MCP resource servers into a single interface with stdio/sse support.
VibeCoding System
A conversation-driven development framework for rapid MVP and POC creation.
CRAN Package README MCP Server
Fetch comprehensive information about CRAN packages, including READMEs, metadata, and search functionality.