kokoro-tts-mcp-server
Model Context Protocol (MCP) server for Kokoro text-to-speech with female voice. 100% local, no Python required. Supports SSE and stdio transports.

MCP server for text-to-speech using Kokoro TTS
100% Local • No Python Required • Female Voice Default
Overview
A production-ready MCP server that provides text-to-speech capabilities using the Kokoro TTS model. Features a default female voice (af_heart) and runs 100% locally using native JavaScript, eliminating Python dependencies.
Built by Ross Technologies
📍 Beer Sheva, Israel | 📧 [email protected]
Features
✅ 100% Local - No external API calls, complete privacy
✅ Native JavaScript - Built with TypeScript and Node.js
✅ SSE & Stdio Support - Multiple transport modes
✅ NPX Ready - Run directly without installation
✅ Female Voice Default - Uses af_heart voice out of the box
✅ Auto Audio Playback - Automatically saves and plays generated audio
Quick Start
# Run with npx (no installation)
npx @ross_tchnologies/kokoro-tts-mcp-server
# Or install globally
npm install -g @ross_tchnologies/kokoro-tts-mcp-server
kokoro-tts-mcp
Installation
NPM
npm install @ross_tchnologies/kokoro-tts-mcp-server
NPX (No Installation)
npx @ross_tchnologies/kokoro-tts-mcp-server
Global Installation
npm install -g @ross_tchnologies/kokoro-tts-mcp-server
Visit Installation Guide for detailed instructions.
MCP Client Configuration
Add to your MCP configuration file (e.g., ~/.cursor/mcp.json):
{
"mcpServers": {
"kokoro-tts": {
"command": "npx",
"args": ["-y", "@ross_tchnologies/kokoro-tts-mcp-server"]
}
}
}
See Configuration Guide for more options.
Usage
Text to Speech Tool
{
"name": "text_to_speech",
"arguments": {
"text": "Hello, world!",
"voice": "af_heart", // optional, default
"speed": 1.0 // optional, default
}
}
Available Voices: af_heart (default), af_bella, af_sarah, and more.
See API Reference for complete documentation.
Documentation
Quick Links:
- 📖 Installation Guide
- 🚀 Getting Started
- ⚙️ Configuration
- 📖 API Reference
- 💻 Examples
- 🔧 Troubleshooting
- ❓ FAQ
- 🛠️ Development Guide
Requirements
- Node.js v18 or higher
- No Python or other external dependencies required! 🎉
Troubleshooting
- First run: Downloads ~300MB model (one-time, takes 2-5 minutes)
- Audio not playing: File is saved - check response message for location
- WSL users: Copy audio files to Windows Desktop for playback
See Troubleshooting Guide for detailed solutions.
Contributing
Contributions are welcome! See:
Links & Resources
- 📦 NPM Package - Install from npm
- 🐙 GitHub Repository - Source code and issues
- 📚 Documentation Wiki - Complete documentation
- 🐛 Report Issues - Bug reports and feature requests
- 💬 Discussions - Community discussions
- 🔗 Model Context Protocol - Learn about MCP
- 🔗 Kokoro TTS Model - Underlying TTS engine
License
MIT License - See LICENSE for details.
Copyright © 2025 Ross Technologies. All rights reserved.
Made with ❤️ by Ross Technologies
관련 서버
MCP-Pushover Bridge
Enables AI assistants to send push notifications via the Pushover service.
MCP Telegram
Control Telegram using LLMs through the Model Context Protocol (MCP).
ACP-MCP-Server
A bridge server connecting Agent Communication Protocol (ACP) agents with Model Context Protocol (MCP) clients.
MCP-Lingvanex-Translate
Lingvanex MCP Server Cloud API – A Model Context Protocol server that provides fast and accurate text translation using the Lingvanex Cloud API. Supports 100+ languages and integrates seamlessly with Claude Desktop and other MCP-compatible clients.
MCP Notify Server
Sends desktop notifications with sound when agent tasks are completed.
mcp-proxy
A standalone proxy to connect STDIO-based MCP clients with HTTP (SSE) based MCP servers.
Telegram MCP
An MCP server for interacting with the Telegram messaging service using the mtcute library.
Gemini Email Subject Generator MCP
Generates engaging email subjects and detailed thinking processes using Google's Gemini AI model.
ChatMCP
A cross-platform AI chat client supporting desktop, mobile, and web platforms.
ClickSend MCP Server
Send SMS messages and make Text-to-Speech (TTS) calls using the ClickSend API.