Open Census MCP Server
Access and query U.S. Census demographic data using natural language.
Open Census MCP Server
Disclaimer
This is an independent, open-source experiment. It is not affiliated with, endorsed by, or sponsored by the U.S. Census Bureau or the Department of Commerce.
Data retrieved through this project remains subject to the terms of the original data providers (e.g., Census API Terms of Service).
What Is This?
An AI-powered statistical consultant for U.S. Census data. Ask questions in plain English, get accurate demographic data with proper statistical context, methodology guidance, and fitness-for-use caveats.
The insight: Census data has a pragmatics problem, not a search problem. Knowing WHICH data to use and HOW to interpret it matters more than finding it. This system encodes statistical consulting expertise into the AI interaction layer.
Status
š¬ Active Research & Rebuild ā v3 architecture in progress. See docs/lessons_learned/ for the v1/v2 journey.
Vision
Census data influences billions in policy decisions, but accessing it effectively requires specialized knowledge. This project aims to make America's most valuable public dataset as easy to use as asking a question ā with the statistical rigor of a professional consultant.
The opportunity: Every city council member, journalist, nonprofit director, and curious citizen should be able to fact-check claims and understand their communities with the same ease an eighth-grader uses a search engine. The data is public. The expertise to use it properly shouldn't be gatekept by technical complexity.
Architecture (v3)
Pure Python MCP server with pragmatic rules engine. No R dependency.
- Pragmatic Rules Layer: Fitness-for-use constraints (MOE thresholds, coverage bias, temporal validity, source selection)
- Census API Integration: Direct Python calls to Census Bureau APIs
- Knowledge Base: Methodology documentation for RAG-enhanced guidance
Details: docs/architecture/ (coming soon)
Project Structure
docs/ # Systems engineering documentation
requirements/ # ConOps, SRS
architecture/ # System architecture
decisions/ # ADRs, trade studies
design/ # Detailed design
verification/ # V&V, evaluation results
lessons_learned/ # Project narrative & lessons
knowledge-base/ # Source docs & pragmatic rules
source-docs/ # Census methodology PDFs (gitignored)
rules/ # Extracted pragmatic rules
methodology/ # Processed methodology content
src/ # MCP server source code
tests/ # Evaluation harness & unit tests
scripts/ # Build & utility scripts
Acknowledgments
- U.S. Census Bureau ā for collecting and maintaining vital public data
- Kyle Walker ā Analyzing US Census Data textbook as knowledge base source
- Anthropic ā Model Context Protocol enabling AI tool integration
Contributing
Contributions welcome, especially:
- Domain expertise from Census data veterans
- Statistical methodology review
- Evaluation test cases (real-world query scenarios)
License
MIT License - see LICENSE file for details.
Related Servers
Drug Gene Interaction Database (DGIdb)
A bridge to the Drug Gene Interaction Database (DGIdb) API, enabling AI clients to query drug-gene interaction data.
MCP OpenDART
Access financial data from Korea's OpenDART (Data Analysis, Retrieval and Transfer System) for AI language models.
Momento MCP Server
An MCP server providing a simple interface to Momento's serverless caching service.
Octagon
Deliver real-time investment research with extensive private and public market data.
Adobe Commerce MCP Server by CData
A read-only MCP server for Adobe Commerce, enabling LLMs to query live data using the CData JDBC driver.
Database
Database MCP server for MySQL, MariaDB, PostgreSQL & SQLite
Self-Hosted Supabase MCP Server
Interact with self-hosted Supabase instances for database management and introspection.
MongoDB MCP Server
A server for interacting with MongoDB databases and MongoDB Atlas.
CData Sync
A Model Context Protocol server for CData Sync, enabling data replication and transformation.
DART MCP Server
Access corporate disclosure information, financial data, and reports from the Korean electronic disclosure system (DART) API.