OpenAI Speech-to-Text
Transcribe audio files using OpenAI's Speech-to-Text API.
OpenAI Speech-to-Text transcriptions MCP Server
A MCP server that provides audio transcription capabilities using OpenAI's API.
Installation
Setup
- Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
- Install dependencies:
npm install
- Build the server:
npm run build
-
Set up your OpenAI API key in your environment variables.
-
Add the server configuration to your environment:
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.
Features
Tools
transcribe_audio- Transcribe audio files using OpenAI's API- Takes filepath as a required parameter
- Optional parameters:
- save_to_file: Boolean to save transcription to a file
- language: ISO-639-1 language code (e.g., "en", "es")
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Server Terkait
RabbitMQ MCP Go
A Go-based MCP server for integrating with the RabbitMQ message broker.
nadanada_me
A public MCP server that gives AI agents access to real UK carrier phone numbers for SMS verification. Agents can rent disposable or rental numbers, pay Lightning invoices, and read incoming SMS, all through standard MCP tool calls with no authentication required.
MCP-Typebot
Integrates Typebot's REST API as callable tools, allowing interaction with Typebot forms and chats.
x402mail
Send and receive emails via Python SDK or MCP. No API keys, no accounts - your wallet is your identity. Pay per call with USDC on Base via the x402 protocol. $0.005 per email.
dTelecom STT
Real-time speech-to-text for AI assistants. Transcribe audio files with production-grade accuracy. Pay per use with USDC via x402 — no API keys needed.
EMQX MCP Server
Interact with an EMQX MQTT broker via a Model Context Protocol (MCP) server.
Ntfy MCP Server
Send push notifications via the ntfy service, enabling LLMs and AI agents to notify your devices.
Tangerine
An MCP server for Tangerine, the Convo AI assistant backend.
Chatterbox TTS
Generates text-to-speech audio with automatic playback using the Chatterbox TTS model.
Webex MCP Server
Provides AI assistants with comprehensive access to Cisco Webex messaging capabilities.