Open Census MCP Server
Access and query U.S. Census demographic data using natural language.
Open Census MCP Server
Disclaimer
This is an independent, open-source experiment. It is not affiliated with, endorsed by, or sponsored by the U.S. Census Bureau or the Department of Commerce.
Data retrieved through this project remains subject to the terms of the original data providers (e.g., Census API Terms of Service).
What Is This?
An AI-powered statistical consultant for U.S. Census data. Ask questions in plain English, get accurate demographic data with proper statistical context, methodology guidance, and fitness-for-use caveats.
The insight: Census data has a pragmatics problem, not a search problem. Knowing WHICH data to use and HOW to interpret it matters more than finding it. This system encodes statistical consulting expertise into the AI interaction layer.
Status
🔬 Active Research & Rebuild — v3 architecture in progress. See docs/lessons_learned/ for the v1/v2 journey.
Vision
Census data influences billions in policy decisions, but accessing it effectively requires specialized knowledge. This project aims to make America's most valuable public dataset as easy to use as asking a question — with the statistical rigor of a professional consultant.
The opportunity: Every city council member, journalist, nonprofit director, and curious citizen should be able to fact-check claims and understand their communities with the same ease an eighth-grader uses a search engine. The data is public. The expertise to use it properly shouldn't be gatekept by technical complexity.
Architecture (v3)
Pure Python MCP server with pragmatic rules engine. No R dependency.
- Pragmatic Rules Layer: Fitness-for-use constraints (MOE thresholds, coverage bias, temporal validity, source selection)
- Census API Integration: Direct Python calls to Census Bureau APIs
- Knowledge Base: Methodology documentation for RAG-enhanced guidance
Details: docs/architecture/ (coming soon)
Project Structure
docs/ # Systems engineering documentation
requirements/ # ConOps, SRS
architecture/ # System architecture
decisions/ # ADRs, trade studies
design/ # Detailed design
verification/ # V&V, evaluation results
lessons_learned/ # Project narrative & lessons
knowledge-base/ # Source docs & pragmatic rules
source-docs/ # Census methodology PDFs (gitignored)
rules/ # Extracted pragmatic rules
methodology/ # Processed methodology content
src/ # MCP server source code
tests/ # Evaluation harness & unit tests
scripts/ # Build & utility scripts
Acknowledgments
- U.S. Census Bureau — for collecting and maintaining vital public data
- Kyle Walker — Analyzing US Census Data textbook as knowledge base source
- Anthropic — Model Context Protocol enabling AI tool integration
Contributing
Contributions welcome, especially:
- Domain expertise from Census data veterans
- Statistical methodology review
- Evaluation test cases (real-world query scenarios)
License
MIT License - see LICENSE file for details.
Servidores relacionados
Eugene Intelligence
Financial data for AI agents. SEC XBRL fundamentals, insider trades, 13F holdings, treasury yields. Source-traced.
Engram MCP Server
Engram is a hosted MCP server that provides reliable memory for AI agents:
ChromaDB
Provides AI assistants with persistent memory using ChromaDB vector storage.
claude-memory-fts
Long-term memory mcp server with sqlite fts5 full-text search, bm25 ranking, and access tracking. zero config via npx.com
Macrostrat
Access geologic data from the Macrostrat API, including units, columns, minerals, and timescales.
OpenAlex Author Disambiguation
Disambiguate authors and resolve institutions using the OpenAlex.org API.
Fedspeak MCP Server
Access and analyze Federal Reserve (FOMC) statements.
CData Reckon Accounts Hosted
A read-only MCP server by CData that enables LLMs to query live data from Reckon Accounts Hosted. Requires a separate CData JDBC Driver.
Memento MCP
A scalable knowledge graph memory system for LLMs with semantic retrieval and temporal awareness, using Neo4j as a backend.
mem0-mcp-selfhosted
Self-hosted mem0 MCP server for Claude Code. Run a complete memory server against self-hosted Qdrant + Neo4j + Ollama while using Claude as the main LLM.