kokoro-tts-mcp-server
Model Context Protocol (MCP) server for Kokoro text-to-speech with female voice. 100% local, no Python required. Supports SSE and stdio transports.

MCP server for text-to-speech using Kokoro TTS
100% Local • No Python Required • Female Voice Default
Overview
A production-ready MCP server that provides text-to-speech capabilities using the Kokoro TTS model. Features a default female voice (af_heart) and runs 100% locally using native JavaScript, eliminating Python dependencies.
Built by Ross Technologies
📍 Beer Sheva, Israel | 📧 [email protected]
Features
✅ 100% Local - No external API calls, complete privacy
✅ Native JavaScript - Built with TypeScript and Node.js
✅ SSE & Stdio Support - Multiple transport modes
✅ NPX Ready - Run directly without installation
✅ Female Voice Default - Uses af_heart voice out of the box
✅ Auto Audio Playback - Automatically saves and plays generated audio
Quick Start
# Run with npx (no installation)
npx @ross_tchnologies/kokoro-tts-mcp-server
# Or install globally
npm install -g @ross_tchnologies/kokoro-tts-mcp-server
kokoro-tts-mcp
Installation
NPM
npm install @ross_tchnologies/kokoro-tts-mcp-server
NPX (No Installation)
npx @ross_tchnologies/kokoro-tts-mcp-server
Global Installation
npm install -g @ross_tchnologies/kokoro-tts-mcp-server
Visit Installation Guide for detailed instructions.
MCP Client Configuration
Add to your MCP configuration file (e.g., ~/.cursor/mcp.json):
{
"mcpServers": {
"kokoro-tts": {
"command": "npx",
"args": ["-y", "@ross_tchnologies/kokoro-tts-mcp-server"]
}
}
}
See Configuration Guide for more options.
Usage
Text to Speech Tool
{
"name": "text_to_speech",
"arguments": {
"text": "Hello, world!",
"voice": "af_heart", // optional, default
"speed": 1.0 // optional, default
}
}
Available Voices: af_heart (default), af_bella, af_sarah, and more.
See API Reference for complete documentation.
Documentation
Quick Links:
- 📖 Installation Guide
- 🚀 Getting Started
- ⚙️ Configuration
- 📖 API Reference
- 💻 Examples
- 🔧 Troubleshooting
- ❓ FAQ
- 🛠️ Development Guide
Requirements
- Node.js v18 or higher
- No Python or other external dependencies required! 🎉
Troubleshooting
- First run: Downloads ~300MB model (one-time, takes 2-5 minutes)
- Audio not playing: File is saved - check response message for location
- WSL users: Copy audio files to Windows Desktop for playback
See Troubleshooting Guide for detailed solutions.
Contributing
Contributions are welcome! See:
Links & Resources
- 📦 NPM Package - Install from npm
- 🐙 GitHub Repository - Source code and issues
- 📚 Documentation Wiki - Complete documentation
- 🐛 Report Issues - Bug reports and feature requests
- 💬 Discussions - Community discussions
- 🔗 Model Context Protocol - Learn about MCP
- 🔗 Kokoro TTS Model - Underlying TTS engine
License
MIT License - See LICENSE for details.
Copyright © 2025 Ross Technologies. All rights reserved.
Made with ❤️ by Ross Technologies
関連サーバー
Reddit
Interact with Reddit by posting comments and searching for posts.
ElevenLabs
The official ElevenLabs MCP server
Gmail MCP Server
Allows AI agents to search Gmail threads, learn your writing style, and draft emails.
Aligo SMS
Send SMS messages using the Aligo SMS API.
mcp2mqtt
An MCP service for device communication and PWM control via the MQTT protocol.
Email Reader
Read, search, and manage emails from your inbox, including downloading attachments and listing folders.
Tldv
Connect your AI agents to Google-Meet, Zoom & Microsoft Teams through tl;dv
SourceGeek
Enable LinkedIn outreach and retrieving enriched Profile data
Telegram MCP Server
Interact with the Telegram messaging service to send and receive messages.
Audio MCP Server
Provides audio input and output capabilities for AI assistants.