CC Token Saver
Use a local LLM for smaller or specialized tasks within Claude to save tokens.
cc_token_saver_mcp
Allow Claude code to use local llm for smaller tasks to save token or for specialized task.
Reduce your Claude Code tokens with ‘CC token saver’ MCP server that intelligently delegates simple tasks to your local LLM while keeping Claude Code for complex coordination and architecture decisions.
The MCP server exposes your local LLM as tools that Claude Code can use for:
- Code snippet generation
- Simple refactoring tasks
- Documentation writing
- Code reviews
- Basic Q&A Claude Code automatically tries the local LLM first for simple tasks, only using premium tokens when necessary for complex reasoning and multi-step workflows.
MCP server config
Create a .env file with the LLM config
Example:
# Local LLM Configuration
OPENAI_API_KEY=none
OPENAI_BASE_URL=http://localhost:1234/v1
LOCAL_MODEL_NAME=qwen2.5-7b-instruct
LOCAL_LLM_TEMPERATURE=0.7
LOCAL_LLM_MAX_TOKENS=-1
Claude Code MCP config
edit the ~/.claude.json file
"mcpServers": {
"cc-token-saver": {
"type": "stdio",
"command": "python",
"args": [
"<path>/cc_token_saver_mcp/server.py"
]
}
},
Example usage:
Related Servers
MCP Domain Availability Checker
Check domain availability directly from MCP clients using the Namecheap API.
NEI MCP Server
A server for interacting with the NEI platform to query project resources.
OpenAI GPT Image
Generate and edit images using OpenAI's GPT-4o image generation and editing APIs with advanced prompt control.
Gemini CLI
Integrates with the unofficial Google Gemini CLI, allowing file access within configured directories.
Ansible & OpenShift Automation
Provides tools to interact with the Ansible Automation Platform API for automation tasks.
Metasploit MCP Server
An MCP server for integrating with the Metasploit Framework, enabling payload generation and management.
PHP MCP Server for Laravel
A Laravel wrapper for the php-mcp/server library to expose Laravel applications as MCP servers.
Code Scanner Server
Scans code files for definitions, respects .gitignore, and outputs in LLM-friendly formats like XML or Markdown.
Projet MCP Server-Client
An implementation of the Model Context Protocol (MCP) for communication between AI models and external tools, featuring server and client examples in Python and Spring Boot.
npm Package README
Fetch READMEs, metadata, and search for packages on the npm registry.