soundside.ai
MCP-native AI media generation with x402 pay-per-call. Image, video, audio, and music from 6 providers — composable via resource IDs. USDC on Base.
Soundside — Developer Documentation
AI Media Production Platform for Agents
Soundside exposes 15 MCP tools for generating, editing, composing, extracting, and analyzing media — images, video, audio, music, text, and business artifacts. Connect any MCP client. Pay with an API key (credits) or crypto (x402 USDC on Base, no account needed).
Quick Start
# MCP endpoint
https://mcp.soundside.ai/mcp
# Auth: API key or x402 crypto payment
Authorization: Bearer <your-api-key>
POST https://mcp.soundside.ai/mcp
{"jsonrpc":"2.0","id":"1","method":"tools/list","params":{}}
Tools (15)
Generation
| Tool | What It Does | Providers |
|---|---|---|
create_image | Text-to-image, character references | Vertex AI, Grok, Runway, MiniMax, Luma |
create_video | Text-to-video, image-to-video, video extension | Vertex AI (Veo 3.1), Runway, MiniMax, Luma, Grok |
create_audio | TTS, deprecated transcribe compatibility shim, voice cloning, sound effects, voice design | MiniMax, Vertex AI, Runway, Creative Freedom |
create_music | Music from lyrics and style prompts | MiniMax, Creative Freedom |
create_text | LLM chat completions, structured output | Vertex AI (Gemini), Grok, MiniMax |
create_artifact | Charts, presentations, documents, diagrams; bundle mode for multi-artifact packages from a single brief | Plotly, PPTX, WeasyPrint, Mermaid, Gamma |
Editing & Analysis
| Tool | What It Does |
|---|---|
edit_video | Core video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, custom FFmpeg |
edit_audio | Mix, replace, or pad audio on existing media |
compose_media | Add text, overlay media, or build split-screen composites |
apply_effect | Ken Burns, speed ramp, film grain, vignette |
extract_media | Extract frames, frame sets, or audio tracks |
analyze_media | Technical analysis, AI vision QA, transcription, segment detection, and EDL export |
Library Management
| Tool | What It Does |
|---|---|
lib_list | Browse projects, collections, resources, lineage, brand kits; query credit balance |
lib_manage | CRUD for projects, collections, resources, brand kits |
lib_share | Share projects with other users by email |
Pricing Philosophy
Soundside aims to break even on provider pass-through costs with a small margin (~10%). The real value is in the editing engine, library management, and self-hosted models — those are priced at $0.01/call ($0.03 for analyze_media vision_qa).
Live pricing is always available at:
GET https://mcp.soundside.ai/api/x402/status
This returns machine-readable per-tool, per-provider USDC prices. Prices are DB-driven and may change — always check the endpoint rather than hardcoding.
x402: Pay-Per-Call with Crypto
No API key needed. Pay with USDC on Base (L2) per tool call.
Network: eip155:8453 (Base mainnet)
Token: USDC
Facilitator: Coinbase CDP
See x402 Guide for full setup.
Guides
- Getting Started — First MCP connection in 5 minutes
- x402 Pay-Per-Call — Crypto payments, no account needed
- Tool Reference — Detailed docs for all 15 tools
Examples
- Python — API Key — Connect and generate with httpx
- Python — x402 — Pay-per-call with USDC
- TypeScript — API Key — Node.js MCP client
- OpenClaw Skill — One-line config for OpenClaw agents
Links
- Website: soundside.ai
- MCP Endpoint:
https://mcp.soundside.ai/mcp - Live Pricing:
https://mcp.soundside.ai/api/x402/status - GitHub: github.com/soundside-design/soundside-docs
関連サーバー
Oracle Service Cloud by CData
A read-only MCP server by CData that enables LLMs to query live data from Oracle Service Cloud.
MCP Google Map Server
Integrates Google Maps API for location-based queries and data processing.
AWS MCP Servers
Access AWS documentation, best practices, and service integrations via the Model Context Protocol.
Space Frontiers
Interfaces with the Space Frontiers API, enabling language models to interact with its data sources.
Lemon Squeezy Server
Integrates with Lemon Squeezy to manage subscriptions, checkouts, and products.
ConnectWise API Gateway
A comprehensive interface for interacting with the ConnectWise Manage API.
Domino Data Lab
Interact with the Domino Data Lab platform to execute and check the status of Domino jobs.
mcp-pfsense
MCP server for managing pfSense firewalls through AI assistants — firewall rules, DHCP, DNS, gateways, ARP, and services. 17 tools with two-step confirmation for destructive operations.
Remote MCP Proxy
A Docker-based proxy to access local MCP servers through Claude's web UI using the Remote MCP protocol.
Gemini OCR
Provides Optical Character Recognition (OCR) services using Google's Gemini API.