Open Census MCP Server
Access and query U.S. Census demographic data using natural language.
Open Census MCP Server
Disclaimer
This is an independent, open-source experiment. It is not affiliated with, endorsed by, or sponsored by the U.S. Census Bureau or the Department of Commerce.
Data retrieved through this project remains subject to the terms of the original data providers (e.g., Census API Terms of Service).
What Is This?
An AI-powered statistical consultant for U.S. Census data. Ask questions in plain English, get accurate demographic data with proper statistical context, methodology guidance, and fitness-for-use caveats.
The insight: Census data has a pragmatics problem, not a search problem. Knowing WHICH data to use and HOW to interpret it matters more than finding it. This system encodes statistical consulting expertise into the AI interaction layer.
Status
🔬 Active Research & Rebuild — v3 architecture in progress. See docs/lessons_learned/ for the v1/v2 journey.
Vision
Census data influences billions in policy decisions, but accessing it effectively requires specialized knowledge. This project aims to make America's most valuable public dataset as easy to use as asking a question — with the statistical rigor of a professional consultant.
The opportunity: Every city council member, journalist, nonprofit director, and curious citizen should be able to fact-check claims and understand their communities with the same ease an eighth-grader uses a search engine. The data is public. The expertise to use it properly shouldn't be gatekept by technical complexity.
Architecture (v3)
Pure Python MCP server with pragmatic rules engine. No R dependency.
- Pragmatic Rules Layer: Fitness-for-use constraints (MOE thresholds, coverage bias, temporal validity, source selection)
- Census API Integration: Direct Python calls to Census Bureau APIs
- Knowledge Base: Methodology documentation for RAG-enhanced guidance
Details: docs/architecture/ (coming soon)
Project Structure
docs/ # Systems engineering documentation
requirements/ # ConOps, SRS
architecture/ # System architecture
decisions/ # ADRs, trade studies
design/ # Detailed design
verification/ # V&V, evaluation results
lessons_learned/ # Project narrative & lessons
knowledge-base/ # Source docs & pragmatic rules
source-docs/ # Census methodology PDFs (gitignored)
rules/ # Extracted pragmatic rules
methodology/ # Processed methodology content
src/ # MCP server source code
tests/ # Evaluation harness & unit tests
scripts/ # Build & utility scripts
Acknowledgments
- U.S. Census Bureau — for collecting and maintaining vital public data
- Kyle Walker — Analyzing US Census Data textbook as knowledge base source
- Anthropic — Model Context Protocol enabling AI tool integration
Contributing
Contributions welcome, especially:
- Domain expertise from Census data veterans
- Statistical methodology review
- Evaluation test cases (real-world query scenarios)
License
MIT License - see LICENSE file for details.
Related Servers
Unofficial UniProt MCP Server
Access the UniProt protein database with specialized bioinformatics tools for protein research, comparative genomics, and structural biology.
MongoDB MCP Server
A server for interacting with MongoDB databases and MongoDB Atlas.
OrionBelt Semantic Layer
API-first engine and MCP server that transforms declarative YAML model definitions into optimized SQL for Postgres, Snowflake, ClickHouse, Dremio, and Databricks
Schema Search
In-memory natural language schema search over database schemas
Memory-Plus
a lightweight, local RAG memory store to record, retrieve, update, delete, and visualize persistent "memories" across sessions—perfect for developers working with multiple AI coders (like Windsurf, Cursor, or Copilot) or anyone who wants their AI to actually remember them.
Supabase Coolify MCP Server
Comprehensive MCP server for managing self-hosted Supabase on Coolify with full deployment, migrations, edge functions, and rollback support.
Global Database
Access comprehensive company data including financial records, ownership structures, and contact information. Search for businesses using domains, registration numbers, or LinkedIn profiles to streamline due diligence and lead generation. Retrieve historical financial performance and complex corporate group structures to support informed business analysis.
PyAirbyte
An AI-powered server that generates PyAirbyte pipeline code and instructions using OpenAI and connector documentation.
HubDB MCP Server by CData
A read-only MCP server by CData that enables LLMs to query live data from HubDB.
Amela MCP Memory Tool
A local, high-performance memory server for AI agents, built with SQLite, vector embeddings, and a knowledge graph. Packaged for npm and Docker.