Speech AI

Production speech AI MCP server with pronunciation scoring, speech-to-text, and text-to-speech — 10 tools, 7 resources, 3 prompts.

Brainiall AI APIs

API Status License: MIT MCP Servers Azure Marketplace Models

Production AI APIs for speech, text, image, and LLM inference. Available as REST endpoints and MCP servers for AI agents.

Base URL: https://apim-ai-apis.azure-api.net Full API reference for LLMs: llms-full.txt | llms.txt

Products

ProductEndpointsLatencyNotes
Pronunciation Assessment/v1/pronunciation/assess/base64<500ms17MB ONNX, per-phoneme scoring (39 ARPAbet)
Text-to-Speech/v1/tts/synthesize<1s12 voices (American + British), 24kHz WAV
Speech-to-Text/v1/stt/transcribe/base64<500msCompact 17MB model, English, word timestamps
Whisper Pro/v1/whisper/transcribe/base64<3s99 languages, speaker diarization
NLP Suite/v1/nlp/{toxicity,sentiment,entities,pii,language}<50msCPU-only, ONNX, 5 endpoints
Image Processing/v1/image/{remove-background,upscale,restore-face}/base64<3sGPU (A10), BiRefNet + ESRGAN + GFPGAN
LLM Gateway/v1/chat/completionsvaries113+ models, OpenAI-compatible, streaming

Authentication

Include ONE of these headers in every request:

Ocp-Apim-Subscription-Key: YOUR_KEY
Authorization: Bearer YOUR_KEY
api-key: YOUR_KEY

Get API keys at the portal (GitHub sign-in, purchase credits, create key).

Quick Start

Python — LLM Gateway (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://apim-ai-apis.azure-api.net/v1",
    api_key="YOUR_KEY"
)

response = client.chat.completions.create(
    model="claude-sonnet",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Python — Pronunciation Assessment

import requests, base64

audio_b64 = base64.b64encode(open("audio.wav", "rb").read()).decode()
r = requests.post(
    "https://apim-ai-apis.azure-api.net/v1/pronunciation/assess/base64",
    headers={"Ocp-Apim-Subscription-Key": "YOUR_KEY"},
    json={"audio": audio_b64, "text": "Hello world", "format": "wav"}
)
print(r.json()["overallScore"])  # 0-100

Python — NLP Pipeline

import requests

headers = {"Ocp-Apim-Subscription-Key": "YOUR_KEY"}
base = "https://apim-ai-apis.azure-api.net/v1/nlp"

# Sentiment
r = requests.post(f"{base}/sentiment", headers=headers, json={"text": "I love this!"})
print(r.json())  # {"label": "positive", "score": 0.9987}

# PII detection with redaction
r = requests.post(f"{base}/pii", headers=headers, json={"text": "Email [email protected]", "redact": True})
print(r.json()["redacted_text"])  # "Email [EMAIL]"

Node.js — LLM Gateway

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://apim-ai-apis.azure-api.net/v1",
  apiKey: "YOUR_KEY"
});

const res = await client.chat.completions.create({
  model: "claude-sonnet",
  messages: [{ role: "user", content: "Hello!" }]
});
console.log(res.choices[0].message.content);

curl — Image Background Removal

curl -X POST https://apim-ai-apis.azure-api.net/v1/image/remove-background/base64 \
  -H "Ocp-Apim-Subscription-Key: YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d "{\"image\": \"$(base64 -i photo.jpg)\"}"

LLM Gateway — Popular Models

ModelAliasPrice ($/MTok in/out)
Claude Opus 4.6claude-opus$5 / $25
Claude Sonnet 4.6claude-sonnet$3 / $15
Claude Haiku 4.5claude-haiku$1 / $5
DeepSeek R1deepseek-r1$1.35 / $5.40
DeepSeek V3deepseek-v3$0.27 / $1.10
Llama 3.3 70Bllama-3.3-70b$0.72 / $0.72
Amazon Nova Pronova-pro$0.80 / $3.20
Amazon Nova Micronova-micro$0.035 / $0.14
Mistral Large 3mistral-large-3$2 / $6
Qwen3 32Bqwen3-32b$0.35 / $0.35

Full list: GET /v1/models (113+ models from 17 providers).

Supports: streaming SSE, tool calling, structured output (json_object/json_schema), extended thinking.

Works with: OpenAI SDK, LiteLLM, LangChain, Cline, Cursor, Aider, Continue, SillyTavern, Open WebUI.

MCP Servers (for AI Agents)

3 MCP servers with 20 tools total. Streamable HTTP transport.

ServerURLTools
Speech AIhttps://apim-ai-apis.azure-api.net/mcp/pronunciation/mcp10 tools + 8 resources + 3 prompts
NLP Toolshttps://apim-ai-apis.azure-api.net/mcp/nlp/mcp6 tools + 3 resources + 3 prompts
Image Toolshttps://apim-ai-apis.azure-api.net/mcp/image/mcp4 tools + 3 resources + 2 prompts

MCP Configuration (Claude Desktop / Cursor / Cline)

{
  "mcpServers": {
    "brainiall-speech": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/pronunciation/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    },
    "brainiall-nlp": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/nlp/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    },
    "brainiall-image": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/image/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    }
  }
}

Also available on: Smithery (score 95/100) | MCPize | Apify ($0.02/call) | MCP Registry

Examples

FileDescription
python/basic_usage.pySpeech APIs — assess, transcribe, synthesize
python/pronunciation_tutor.pyInteractive pronunciation tutor
javascript/basic_usage.jsNode.js examples for speech APIs
curl/examples.shcurl commands for every endpoint
mcp/claude-desktop-config.jsonMCP config for Claude Desktop
mcp/cursor-config.jsonMCP config for Cursor IDE
llms-full.txtComplete API reference for LLM consumption

Pricing

ProductPriceUnit
Pronunciation$0.02per call
TTS$0.01-0.03per 1K chars
STT (compact)$0.01per request
Whisper Pro$0.02per minute
NLP (any)$0.001-0.002per call
Image (any)$0.003-0.005per image
LLM Gatewaycompetitive pricingper MTok

Credit packages: $5, $10, $25, $50, $100. Portal | Azure Marketplace (search "Brainiall").

License

MIT — Brainiall

Máy chủ liên quan

NotebookLM Web Importer

Nhập trang web và video YouTube vào NotebookLM chỉ với một cú nhấp. Được tin dùng bởi hơn 200.000 người dùng.

Cài đặt tiện ích Chrome