CC Token Saver
Use a local LLM for smaller or specialized tasks within Claude to save tokens.
cc_token_saver_mcp
Allow Claude code to use local llm for smaller tasks to save token or for specialized task.
Reduce your Claude Code tokens with ‘CC token saver’ MCP server that intelligently delegates simple tasks to your local LLM while keeping Claude Code for complex coordination and architecture decisions.
The MCP server exposes your local LLM as tools that Claude Code can use for:
- Code snippet generation
- Simple refactoring tasks
- Documentation writing
- Code reviews
- Basic Q&A Claude Code automatically tries the local LLM first for simple tasks, only using premium tokens when necessary for complex reasoning and multi-step workflows.
MCP server config
Create a .env file with the LLM config
Example:
# Local LLM Configuration
OPENAI_API_KEY=none
OPENAI_BASE_URL=http://localhost:1234/v1
LOCAL_MODEL_NAME=qwen2.5-7b-instruct
LOCAL_LLM_TEMPERATURE=0.7
LOCAL_LLM_MAX_TOKENS=-1
Claude Code MCP config
edit the ~/.claude.json file
"mcpServers": {
"cc-token-saver": {
"type": "stdio",
"command": "python",
"args": [
"<path>/cc_token_saver_mcp/server.py"
]
}
},
Example usage:
Related Servers
Alpha Vantage MCP Server
sponsorAccess financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
JetBrains
Work on your code with JetBrains IDEs
Trade Router MCP
Non-custodial Solana swap & limit-order MCP server. 21 tools across Raydium, PumpSwap, Orca, Meteora. Jito MEV-protected. Private key never leaves the agent.
MCP Hot-Reload
A Hot Module Replacement (HMR) proxy server for MCP servers that automatically restarts on file changes, buffers messages, and manages connections.
MCP Proxy Server
Aggregates multiple MCP resource servers into a single interface.
Ollama MCP Server
Integrate local Ollama LLM instances with MCP-compatible applications.
MCP DevTools
A development tools server for Git management, file operations, AI-assisted editing, and terminal execution, integrable with AI assistants and code editors.
Zeplin
Official Zeplin server for AI-assisted UI development.
OpenAPI Invoker
Invokes any OpenAPI specification through a Model Context Protocol (MCP) server.
Restk
Native macOS API client with embedded MCP server. 31+ tools for AI-assisted API development. Schema extraction ensures AI never sees real credentials or PII. Supports Claude Code, Claude Desktop, Cursor
Remote MCP Server (Authless)
A remote MCP server deployable on Cloudflare Workers without authentication.