Analyze large codebases and document collections using high-context models via OpenRouter, OpenAI, or Google AI -- very useful, e.g., with Claude Code
Consult7 is a Model Context Protocol (MCP) server that enables AI agents to consult large context window models for analyzing extensive file collections - entire codebases, document repositories, or mixed content that exceed the current agent's context limits. Supports providers Openrouter, OpenAI, and Google.
When working with AI agents that have limited context windows (like Claude with 200K tokens), Consult7 allows them to leverage models with massive context windows to analyze large codebases or document collections that would otherwise be impossible to process in a single query.
"For Claude Code users, Consult7 is a game changer."
Consult7 recursively collects all files from a given path that match your regex pattern (including all subdirectories), assembles them into a single context, and sends them to a large context window model along with your query. The result of this query is directly fed back to the agent you are working with.
".*\.py$"
(all Python files)/Users/john/my-python-project
".*\.(py|js|ts)$"
(Python, JavaScript, TypeScript files)/Users/john/backend
".*test.*\.py$|.*_test\.py$"
(test files)/Users/john/project
".*\.(py|js|ts)$"
"gemini-2.5-flash|thinking"
/Users/john/webapp
Simply run:
# OpenRouter
claude mcp add -s user consult7 uvx -- consult7 openrouter your-api-key
# Google AI
claude mcp add -s user consult7 uvx -- consult7 google your-api-key
# OpenAI
claude mcp add -s user consult7 uvx -- consult7 openai your-api-key
Add to your Claude Desktop configuration file:
{
"mcpServers": {
"consult7": {
"type": "stdio",
"command": "uvx",
"args": ["consult7", "openrouter", "your-api-key"]
}
}
}
Replace openrouter
with your provider choice (google
or openai
) and your-api-key
with your actual API key.
No installation required - uvx
automatically downloads and runs consult7 in an isolated environment.
uvx consult7 <provider> <api-key> [--test]
<provider>
: Required. Choose from openrouter
, google
, or openai
<api-key>
: Required. Your API key for the chosen provider--test
: Optional. Test the API connectionThe model is specified when calling the tool, not at startup. The server shows example models for your provider on startup.
Standard models:
"gemini-2.5-flash"
- Fast model"gemini-2.5-flash-lite-preview-06-17"
- Ultra fast lite model"gemini-2.5-pro"
- Intelligent model"gemini-2.0-flash-exp"
- Experimental modelWith thinking mode (add |thinking
suffix):
"gemini-2.5-flash|thinking"
- Fast with deep reasoning"gemini-2.5-flash-lite-preview-06-17|thinking"
- Ultra fast with deep reasoning"gemini-2.5-pro|thinking"
- Intelligent with deep reasoningStandard models:
"google/gemini-2.5-pro"
- Intelligent, 1M context"google/gemini-2.5-flash"
- Fast, 1M context"google/gemini-2.5-flash-lite-preview-06-17"
- Ultra fast, 1M context"anthropic/claude-sonnet-4"
- Claude Sonnet, 200k context"openai/gpt-4.1"
- GPT-4.1, 1M+ contextWith reasoning mode (add |thinking
suffix):
"anthropic/claude-sonnet-4|thinking"
- Claude with 31,999 reasoning tokens"google/gemini-2.5-flash-lite-preview-06-17|thinking"
- Ultra fast with reasoning"openai/gpt-4.1|thinking"
- GPT-4.1 with reasoning effort=highStandard models (include context length):
"gpt-4.1-2025-04-14|1047576"
- 1M+ context, very fast"gpt-4.1-nano-2025-04-14|1047576"
- 1M+ context, ultra fast"o3-2025-04-16|200k"
- Advanced reasoning model"o4-mini-2025-04-16|200k"
- Fast reasoning modelO-series models with |thinking marker:
"o1-mini|128k|thinking"
- Mini reasoning with |thinking marker"o3-2025-04-16|200k|thinking"
- Advanced reasoning with |thinking markerNote: For OpenAI, |thinking is only supported on o-series models and serves as an informational marker. The models use reasoning tokens automatically.
Advanced: You can specify custom thinking tokens with |thinking=30000
but this is rarely needed.
# Test OpenRouter
uvx consult7 openrouter sk-or-v1-... --test
# Test Google AI
uvx consult7 google AIza... --test
# Test OpenAI
uvx consult7 openai sk-proj-... --test
To remove consult7 from Claude Code (or before reinstalling):
claude mcp remove consult7 -s user
A Python server providing Retrieval-Augmented Generation (RAG) functionality. It indexes various document formats and requires a PostgreSQL database with pgvector.
Gentoro generates MCP Servers based on OpenAPI specifications.
A server for blockchain interactions, offering Ethereum vanity address generation, 4byte lookup, ABI encoding, and multi-chain RPC calls.
Enable AI Agents to fix Playwright test failures reported to Currents.
Obtains latest dependency details for Clojure libraries.
A starter project for building Model Context Protocol (MCP) servers with the mcp-framework.
Access and interact with Harness platform data, including pipelines, repositories, logs, and artifact registries.
Generate images using the Together AI API. Supports custom aspect ratios, save paths, and batch generation.
Interact with the Lean theorem prover via the Language Server Protocol (LSP), enabling LLM agents to understand, analyze, and modify Lean projects.
A universal control plane for managing MCP servers and providing intelligent tool discovery for AI agents.