BioMCP
Connects AI assistants to authoritative biomedical data sources like PubMed and ClinicalTrials.gov, enabling natural language queries.
BioMCP: Biomedical Model Context Protocol
BioMCP is an open source (MIT License) toolkit that empowers AI assistants and agents with specialized biomedical knowledge. Built following the Model Context Protocol (MCP), it connects AI systems to authoritative biomedical data sources, enabling them to answer questions about clinical trials, scientific literature, and genomic variants with precision and depth.
MCPHub Certification
BioMCP is certified by MCPHub. This certification ensures that BioMCP follows best practices for Model Context Protocol implementation and provides reliable biomedical data access.
Why BioMCP?
While Large Language Models have broad general knowledge, they often lack specialized domain-specific information or access to up-to-date resources. BioMCP bridges this gap for biomedicine by:
- Providing structured access to clinical trials, biomedical literature, and genomic variants
- Enabling natural language queries to specialized databases without requiring knowledge of their specific syntax
- Supporting biomedical research workflows through a consistent interface
- Functioning as an MCP server for AI assistants and agents
Biomedical Data Sources
BioMCP integrates with multiple biomedical data sources:
Literature Sources
- PubTator3/PubMed - Peer-reviewed biomedical literature with entity annotations
- bioRxiv/medRxiv - Preprint servers for biology and health sciences
- Europe PMC - Open science platform including preprints
Clinical & Genomic Sources
- ClinicalTrials.gov - Clinical trial registry and results database
- NCI Clinical Trials Search API - National Cancer Institute's curated cancer trials database
- Advanced search filters (biomarkers, prior therapies, brain metastases)
- Organization and intervention databases
- Disease vocabulary with synonyms
- BioThings Suite - Comprehensive biomedical data APIs:
- MyVariant.info - Consolidated genetic variant annotation
- MyGene.info - Real-time gene annotations and information
- MyDisease.info - Disease ontology and synonym information
- MyChem.info - Drug/chemical annotations and properties
- TCGA/GDC - The Cancer Genome Atlas for cancer variant data
- 1000 Genomes - Population frequency data via Ensembl
- cBioPortal - Cancer genomics portal with mutation occurrence data
Regulatory & Safety Sources
- OpenFDA - FDA regulatory and safety data:
- Drug Adverse Events (FAERS) - Post-market drug safety reports
- Drug Labels (SPL) - Official prescribing information
- Device Events (MAUDE) - Medical device adverse events, with genomic device filtering
Available MCP Tools
BioMCP provides 24 specialized tools for biomedical research:
Core Tools (3)
1. Think Tool (ALWAYS USE FIRST!)
CRITICAL: The think
tool MUST be your first step for ANY biomedical research task.
# Start analysis with sequential thinking
think(
thought="Breaking down the query about BRAF mutations in melanoma...",
thoughtNumber=1,
totalThoughts=3,
nextThoughtNeeded=True
)
The sequential thinking tool helps:
- Break down complex biomedical problems systematically
- Plan multi-step research approaches
- Track reasoning progress
- Ensure comprehensive analysis
2. Search Tool
The search tool supports two modes:
Unified Query Language (Recommended)
Use the query
parameter with structured field syntax for powerful cross-domain searches:
# Simple natural language
search(query="BRAF melanoma")
# Field-specific search
search(query="gene:BRAF AND trials.condition:melanoma")
# Complex queries
search(query="gene:BRAF AND variants.significance:pathogenic AND articles.date:>2023")
# Get searchable fields schema
search(get_schema=True)
# Explain how a query is parsed
search(query="gene:BRAF", explain_query=True)
Supported Fields:
- Cross-domain:
gene:
,variant:
,disease:
- Trials:
trials.condition:
,trials.phase:
,trials.status:
,trials.intervention:
- Articles:
articles.author:
,articles.journal:
,articles.date:
- Variants:
variants.significance:
,variants.rsid:
,variants.frequency:
Domain-Based Search
Use the domain
parameter with specific filters:
# Search articles (includes automatic cBioPortal integration)
search(domain="article", genes=["BRAF"], diseases=["melanoma"])
# Search with mutation-specific cBioPortal data
search(domain="article", genes=["BRAF"], keywords=["V600E"])
search(domain="article", genes=["SRSF2"], keywords=["F57*"]) # Wildcard patterns
# Search trials
search(domain="trial", conditions=["lung cancer"], phase="3")
# Search variants
search(domain="variant", gene="TP53", significance="pathogenic")
Note: When searching articles with a gene parameter, cBioPortal data is automatically included:
- Gene-level summaries show mutation frequency across cancer studies
- Mutation-specific searches (e.g., "V600E") show study-level occurrence data
- Cancer types are dynamically resolved from cBioPortal API
3. Fetch Tool
Retrieve full details for a single article, trial, or variant:
# Fetch article details (supports both PMID and DOI)
fetch(domain="article", id="34567890") # PMID
fetch(domain="article", id="10.1101/2024.01.20.23288905") # DOI
# Fetch trial with all sections
fetch(domain="trial", id="NCT04280705", detail="all")
# Fetch variant details
fetch(domain="variant", id="rs113488022")
Domain-specific options:
- Articles:
detail="full"
retrieves full text if available - Trials:
detail
can be "protocol", "locations", "outcomes", "references", or "all" - Variants: Always returns full details
Individual Tools (21)
For users who prefer direct access to specific functionality, BioMCP also provides 21 individual tools:
Article Tools (2)
- article_searcher: Search PubMed/PubTator3 and preprints
- article_getter: Fetch detailed article information (supports PMID and DOI)
Trial Tools (5)
- trial_searcher: Search ClinicalTrials.gov or NCI CTS API (via source parameter)
- trial_getter: Fetch all trial details from either source
- trial_protocol_getter: Fetch protocol information only (ClinicalTrials.gov)
- trial_references_getter: Fetch trial publications (ClinicalTrials.gov)
- trial_outcomes_getter: Fetch outcome measures and results (ClinicalTrials.gov)
- trial_locations_getter: Fetch site locations and contacts (ClinicalTrials.gov)
Variant Tools (2)
- variant_searcher: Search MyVariant.info database
- variant_getter: Fetch comprehensive variant details
NCI-Specific Tools (6)
- nci_organization_searcher: Search NCI's organization database
- nci_organization_getter: Get organization details by ID
- nci_intervention_searcher: Search NCI's intervention database (drugs, devices, procedures)
- nci_intervention_getter: Get intervention details by ID
- nci_biomarker_searcher: Search biomarkers used in trial eligibility criteria
- nci_disease_searcher: Search NCI's controlled vocabulary of cancer conditions
Gene, Disease & Drug Tools (3)
- gene_getter: Get real-time gene information from MyGene.info
- disease_getter: Get disease definitions and synonyms from MyDisease.info
- drug_getter: Get drug/chemical information from MyChem.info
Note: All individual tools that search by gene automatically include cBioPortal summaries when the include_cbioportal
parameter is True (default). Trial searches can expand disease conditions with synonyms when expand_synonyms
is True (default).
Quick Start
For Claude Desktop Users
-
Install
uv
if you don't have it (recommended):# MacOS brew install uv # Windows/Linux pip install uv
-
Configure Claude Desktop:
- Open Claude Desktop settings
- Navigate to Developer section
- Click "Edit Config" and add:
{ "mcpServers": { "biomcp": { "command": "uv", "args": ["run", "--with", "biomcp-python", "biomcp", "run"] } } }
- Restart Claude Desktop and start chatting about biomedical topics!
Python Package Installation
# Using pip
pip install biomcp-python
# Using uv (recommended for faster installation)
uv pip install biomcp-python
# Run directly without installation
uv run --with biomcp-python biomcp trial search --condition "lung cancer"
Configuration
Environment Variables
BioMCP supports optional environment variables for enhanced functionality:
# cBioPortal API authentication (optional)
export CBIO_TOKEN="your-api-token" # For authenticated access
export CBIO_BASE_URL="https://www.cbioportal.org/api" # Custom API endpoint
# Performance tuning
export BIOMCP_USE_CONNECTION_POOL="true" # Enable HTTP connection pooling (default: true)
export BIOMCP_METRICS_ENABLED="false" # Enable performance metrics (default: false)
Running BioMCP Server
BioMCP supports multiple transport protocols to suit different deployment scenarios:
Local Development (STDIO)
For direct integration with Claude Desktop or local MCP clients:
# Default STDIO mode for local development
biomcp run
# Or explicitly specify STDIO
biomcp run --mode stdio
HTTP Server Mode
BioMCP supports multiple HTTP transport protocols:
Legacy SSE Transport (Worker Mode)
For backward compatibility with existing SSE clients:
biomcp run --mode worker
# Server available at http://localhost:8000/sse
Streamable HTTP Transport (Recommended)
The new MCP-compliant Streamable HTTP transport provides optimal performance and standards compliance:
biomcp run --mode streamable_http
# Custom host and port
biomcp run --mode streamable_http --host 127.0.0.1 --port 8080
Features of Streamable HTTP transport:
- Single
/mcp
endpoint for all operations - Dynamic response mode (JSON for quick operations, SSE for long-running)
- Session management support (future)
- Full MCP specification compliance (2025-03-26)
- Better scalability for cloud deployments
Deployment Options
Docker
# Build the Docker image locally
docker build -t biomcp:latest .
# Run the container
docker run -p 8000:8000 biomcp:latest biomcp run --mode streamable_http
Cloudflare Workers
The worker mode can be deployed to Cloudflare Workers for global edge deployment.
Note: All APIs work without authentication, but tokens may provide higher rate limits.
Command Line Interface
BioMCP provides a comprehensive CLI for direct database interaction:
# Get help
biomcp --help
# Run the MCP server
biomcp run
# Article search examples
biomcp article search --gene BRAF --disease Melanoma # Includes preprints by default
biomcp article search --gene BRAF --no-preprints # Exclude preprints
biomcp article get 21717063 --full
# Clinical trial examples
biomcp trial search --condition "Lung Cancer" --phase PHASE3
biomcp trial search --condition melanoma --source nci --api-key YOUR_KEY # Use NCI API
biomcp trial get NCT04280705 Protocol
biomcp trial get NCT04280705 --source nci --api-key YOUR_KEY # Get from NCI
# Variant examples with external annotations
biomcp variant search --gene TP53 --significance pathogenic
biomcp variant get rs113488022 # Includes TCGA, 1000 Genomes, and cBioPortal data by default
biomcp variant get rs113488022 --no-external # Core annotations only
# NCI-specific examples (requires NCI API key)
biomcp organization search "MD Anderson" --api-key YOUR_KEY
biomcp organization get ORG123456 --api-key YOUR_KEY
biomcp intervention search pembrolizumab --api-key YOUR_KEY
biomcp intervention search --type Device --api-key YOUR_KEY
biomcp biomarker search "PD-L1" --api-key YOUR_KEY
biomcp disease search melanoma --source nci --api-key YOUR_KEY
Testing & Verification
Test your BioMCP setup with the MCP Inspector:
npx @modelcontextprotocol/inspector uv run --with biomcp-python biomcp run
This opens a web interface where you can explore and test all available tools.
Enterprise Version: OncoMCP
OncoMCP extends BioMCP with GenomOncology's enterprise-grade precision oncology platform (POP), providing:
- HIPAA-Compliant Deployment: Secure on-premise options
- Real-Time Trial Matching: Up-to-date status and arm-level matching
- Healthcare Integration: Seamless EHR and data warehouse connectivity
- Curated Knowledge Base: 15,000+ trials and FDA approvals
- Sophisticated Patient Matching: Using integrated clinical and molecular profiles
- Advanced NLP: Structured extraction from unstructured text
- Comprehensive Biomarker Processing: Mutation and rule processing
Learn more: GenomOncology
MCP Registries
Example Use Cases
Gene Information Retrieval
# Get comprehensive gene information
gene_getter(gene_id_or_symbol="TP53")
# Returns: Official name, summary, aliases, links to databases
Disease Synonym Expansion
# Get disease information with synonyms
disease_getter(disease_id_or_name="GIST")
# Returns: "gastrointestinal stromal tumor" and other synonyms
# Search trials with automatic synonym expansion
trial_searcher(conditions=["GIST"], expand_synonyms=True)
# Searches for: GIST OR "gastrointestinal stromal tumor" OR "GI stromal tumor"
Integrated Biomedical Research
# 1. Always start with thinking
think(thought="Analyzing BRAF V600E in melanoma treatment", thoughtNumber=1)
# 2. Get gene context
gene_getter("BRAF")
# 3. Search for pathogenic variants
variant_searcher(gene="BRAF", hgvsp="V600E", significance="pathogenic")
# 4. Find relevant clinical trials with disease expansion
trial_searcher(conditions=["melanoma"], interventions=["BRAF inhibitor"])
Documentation
For comprehensive documentation, visit https://biomcp.org
Developer Guides
- HTTP Client Guide - Using the centralized HTTP client
- Migration Examples - Migrating from direct HTTP usage
- Error Handling Guide - Comprehensive error handling patterns
- Integration Testing Guide - Best practices for reliable integration tests
- Third-Party Endpoints - Complete list of external APIs used
- Testing Guide - Running tests and understanding test categories
Development
Running Tests
# Run all tests (including integration tests)
make test
# Run only unit tests (excluding integration tests)
uv run python -m pytest tests -m "not integration"
# Run only integration tests
uv run python -m pytest tests -m "integration"
Note: Integration tests make real API calls and may fail due to network issues or rate limiting. In CI/CD, integration tests are run separately and allowed to fail without blocking the build.
BioMCP Examples Repo
Looking to see BioMCP in action?
Check out the companion repository: 👉 biomcp-examples
It contains real prompts, AI-generated research briefs, and evaluation runs across different models. Use it to explore capabilities, compare outputs, or benchmark your own setup.
Have a cool example of your own? We’d love for you to contribute! Just fork the repo and submit a PR with your experiment.
License
This project is licensed under the MIT License.
Related Servers
D&D 5E MCP Server
Access Dungeons & Dragons 5th Edition content, including spells, classes, and monsters, via the Open5e API.
RentCast
Access property data, valuations, and market statistics using the RentCast API.
Atlan
Official MCP Server from Atlan which enables you to bring the power of metadata to your AI tools
Teradata
A collection of tools for managing the platform, addressing data quality and reading and writing to Teradata Database.
Snowflake Cortex
An experimental MCP server to access Snowflake Cortex insights from your development environment.
Dynamics 365 MCP Server by CData
A read-only MCP server by CData that enables LLMs to query live data from Dynamics 365. Requires the CData JDBC Driver for Dynamics 365.
Polygon MCP Server
Provides on-chain tools to interact with the Polygon PoS blockchain.
MySQL MCP Server
Integrates with MySQL databases to provide secure database access for LLMs.
Chroma MCP Server
An MCP server for the Chroma embedding database, providing persistent, searchable working memory for AI-assisted development with features like automated context recall and codebase indexing.
Blockscout
Access blockchain data like balances, tokens, and NFTs from Blockscout APIs. Supports multi-chain and progress notifications.