soundside.ai

offiziell

MCP-native AI media generation with x402 pay-per-call. Image, video, audio, and music from 6 providers — composable via resource IDs. USDC on Base.

Soundside — Developer Documentation

AI Media Production Platform for Agents

Soundside exposes 19 MCP tools for generating, editing, composing, extracting, and analyzing media — images, video, audio, music, text, and business artifacts — plus LoRA adapter fine-tuning and server-side video composition. Connect any MCP client. Pay with an API key (credits) or crypto (x402 USDC on Base, no account needed).

Quick Start

# MCP endpoint
https://mcp.soundside.ai/mcp

# Auth: API key or x402 crypto payment
Authorization: Bearer <your-api-key>
POST https://mcp.soundside.ai/mcp
{"jsonrpc":"2.0","id":"1","method":"tools/list","params":{}}

Tools (19)

Generation

ToolWhat It DoesProviders
create_imageText-to-image, character referencesAlibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI
create_videoText-to-video, image-to-video, video extensionAlibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI (Veo 3.1)
create_audioTTS, sound effects, voice cloning, voice designMiniMax, Runway, Vertex AI
create_musicMusic from lyrics and style promptsMiniMax
create_textLLM chat completions, structured outputGrok, MiniMax, Vertex AI (Gemini)
create_artifactCharts, presentations, documents, diagrams; bundle mode for multi-artifact packagesplotly, pptx, docx, weasyprint, mermaid, gamma

Composition

ToolWhat It Does
compose_videoServer-side pipeline: enrich plan, generate assets in parallel, assemble with transitions, audio ducking, and overlays

Editing

ToolWhat It Does
edit_videoCore video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, custom FFmpeg
edit_audioMix, replace, or pad audio on existing media
compose_mediaAdd text, overlay media, or build split-screen composites
apply_effectKen Burns, speed ramp, film grain, vignette
extract_mediaExtract frames, frame sets, or audio tracks

Analysis

ToolWhat It DoesProviders
analyze_mediaTechnical metadata, vision QA, transcription, segment detection, EDL exportAnthropic, Grok, OpenAI, Qwen, Vertex (+ soundside.ai ffprobe)

Adapters (LoRA)

ToolWhat It DoesBackends
train_adapterTrain a LoRA adapter from library mediaDashScope (Wan), Modal (Hunyuan/LTX)
list_adaptersList your LoRA adapters
manage_adapterInspect, deploy, undeploy, delete, or select checkpoint

Library Management

ToolWhat It Does
lib_listBrowse projects, collections, resources, lineage, brand kits; query credit balance
lib_manageCRUD for projects, collections, resources, brand kits
lib_shareShare projects with other users by email

Pricing

Soundside aims to break even on provider pass-through costs with a small margin (~10%). The editing engine and library are priced at $0.01/call; vision QA is $0.03.

Live pricing is always available at:

GET https://mcp.soundside.ai/api/x402/status

This returns machine-readable per-tool, per-provider USDC prices. Prices are DB-driven and may change — always check the endpoint rather than hardcoding.

x402: Pay-Per-Call with Crypto

No API key needed. Pay with USDC on Base (L2) per tool call via EIP-3009 transferWithAuthorization (off-chain signing, facilitator pays gas).

Network: eip155:8453 (Base mainnet)
Token: USDC
Facilitator: Coinbase CDP

See x402 Guide for full setup.

Guides

Examples

Links

Verwandte Server

NotebookLM Web Importer

Importieren Sie Webseiten und YouTube-Videos mit einem Klick in NotebookLM. Vertraut von über 200.000 Nutzern.

Chrome-Erweiterung installieren