OpenAI Speech-to-Text
Transcribe audio files using OpenAI's Speech-to-Text API.
OpenAI Speech-to-Text transcriptions MCP Server
A MCP server that provides audio transcription capabilities using OpenAI's API.
Installation
Setup
- Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
- Install dependencies:
npm install
- Build the server:
npm run build
-
Set up your OpenAI API key in your environment variables.
-
Add the server configuration to your environment:
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.
Features
Tools
transcribe_audio- Transcribe audio files using OpenAI's API- Takes filepath as a required parameter
- Optional parameters:
- save_to_file: Boolean to save transcription to a file
- language: ISO-639-1 language code (e.g., "en", "es")
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Serveurs connexes
Vapi MCP Server
A server for integrating with Vapi's voice AI APIs using function calls.
BlueSky
Access the BlueSky social network data via its official API.
VoiceVox
A server for text-to-speech (TTS) using the VoiceVox engine.
Gmail MCP Server
An MCP server that enables AI models to interact directly with the Gmail API to manage emails.
MCP Discord Agent Communication
Enables asynchronous communication between AI agents and users through Discord, ideal for long-running tasks.
YCloud WhatsApp API
Interact with the YCloud WhatsApp API to send and manage messages.
ChatSum
Summarize chat messages from a local database file.
Chara Talk MCP
Enables communication between multiple AI characters with simultaneous voice playback using VLC.
WhatsApp Cloned Voice Messages
Integrates WhatsApp and Minimax to send personalized voice messages using cloned voices.
DingTalk
A server for interacting with DingTalk workspaces using the Model Context Protocol.