soundside.ai

resmi

MCP-native AI media generation with x402 pay-per-call. Image, video, audio, and music from 6 providers — composable via resource IDs. USDC on Base.

Soundside — Developer Documentation

AI Media Production Platform for Agents

Soundside exposes 19 MCP tools for generating, editing, composing, extracting, and analyzing media — images, video, audio, music, text, and business artifacts — plus LoRA adapter fine-tuning and server-side video composition. Connect any MCP client. Pay with an API key (credits) or crypto (x402 USDC on Base, no account needed).

Quick Start

# MCP endpoint
https://mcp.soundside.ai/mcp

# Auth: API key or x402 crypto payment
Authorization: Bearer <your-api-key>
POST https://mcp.soundside.ai/mcp
{"jsonrpc":"2.0","id":"1","method":"tools/list","params":{}}

Tools (19)

Generation

ToolWhat It DoesProviders
create_imageText-to-image, character referencesAlibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI
create_videoText-to-video, image-to-video, video extensionAlibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI (Veo 3.1)
create_audioTTS, sound effects, voice cloning, voice designMiniMax, Runway, Vertex AI
create_musicMusic from lyrics and style promptsMiniMax
create_textLLM chat completions, structured outputGrok, MiniMax, Vertex AI (Gemini)
create_artifactCharts, presentations, documents, diagrams; bundle mode for multi-artifact packagesplotly, pptx, docx, weasyprint, mermaid, gamma

Composition

ToolWhat It Does
compose_videoServer-side pipeline: enrich plan, generate assets in parallel, assemble with transitions, audio ducking, and overlays

Editing

ToolWhat It Does
edit_videoCore video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, custom FFmpeg
edit_audioMix, replace, or pad audio on existing media
compose_mediaAdd text, overlay media, or build split-screen composites
apply_effectKen Burns, speed ramp, film grain, vignette
extract_mediaExtract frames, frame sets, or audio tracks

Analysis

ToolWhat It DoesProviders
analyze_mediaTechnical metadata, vision QA, transcription, segment detection, EDL exportAnthropic, Grok, OpenAI, Qwen, Vertex (+ soundside.ai ffprobe)

Adapters (LoRA)

ToolWhat It DoesBackends
train_adapterTrain a LoRA adapter from library mediaDashScope (Wan), Modal (Hunyuan/LTX)
list_adaptersList your LoRA adapters
manage_adapterInspect, deploy, undeploy, delete, or select checkpoint

Library Management

ToolWhat It Does
lib_listBrowse projects, collections, resources, lineage, brand kits; query credit balance
lib_manageCRUD for projects, collections, resources, brand kits
lib_shareShare projects with other users by email

Pricing

Soundside aims to break even on provider pass-through costs with a small margin (~10%). The editing engine and library are priced at $0.01/call; vision QA is $0.03.

Live pricing is always available at:

GET https://mcp.soundside.ai/api/x402/status

This returns machine-readable per-tool, per-provider USDC prices. Prices are DB-driven and may change — always check the endpoint rather than hardcoding.

x402: Pay-Per-Call with Crypto

No API key needed. Pay with USDC on Base (L2) per tool call via EIP-3009 transferWithAuthorization (off-chain signing, facilitator pays gas).

Network: eip155:8453 (Base mainnet)
Token: USDC
Facilitator: Coinbase CDP

See x402 Guide for full setup.

Guides

Examples

Links

İlgili Sunucular

NotebookLM Web Importer

Web sayfalarını ve YouTube videolarını tek tıkla NotebookLM'e aktarın. 200.000'den fazla kullanıcı tarafından güveniliyor.

Chrome Eklentisini Yükle