媒體技能

ace-step
agentspace-so
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativeaudiomedia
ace-step
doany-ai
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativemediaaudio
ai-avatar-video
qu-skills
透過 inference.sh CLI 建立 AI 虛擬人偶與說話頭影片。推薦:P-Video-Avatar(最快、最便宜、內建 TTS)。另可選:OmniHuman、Fabric、PixVerse。音訊:Inworld TTS-2(100 多種語言、角色情感引導)、ElevenLabs、Kokoro。功能:音訊驅動虛擬人偶、文字轉虛擬人偶、唇形同步影片、說話頭生成、虛擬主持人、UGC 內容。用途:AI 主持人、解說影片、虛擬網紅、配音、行銷影片、UGC 廣告、遊戲虛擬人偶……
videocreativemedia
ai-avatar-video
agentspace-so
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-avatar-video
runcomfy-com
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-avatar-video
doany-ai
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-image-generation
qu-skills
透過 inference.sh CLI 使用 GPT-Image-2、FLUX、Gemini、Grok、Seedream、Reve 及 50 多種模型生成 AI 圖像。模型包括:GPT-Image-2、FLUX Dev LoRA、FLUX.2 Klein LoRA、Gemini 3 Pro Image、Grok Imagine、Seedream 4.5、Reve、ImagineArt。功能涵蓋:文字轉圖像、圖像轉圖像、修補、LoRA、圖像編輯、放大、文字渲染。適用於:AI 藝術、產品模型、概念藝術、社交媒體圖形、行銷視覺、插圖。觸發詞:flux、圖像生成、AI 圖像、文字轉...
creativemediaimage
ai-image-generation
agentspace-so
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-image-generation
doany-ai
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-image-generation
runcomfy-com
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-music
runcomfy-com
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativeaudiomedia
ai-music
doany-ai
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativemediaaudio
ai-video-generation
qu-skills
透過 inference.sh CLI 使用 Google Veo、Seedance 2.0、HappyHorse、Wan、Grok 及 40 多種模型生成 AI 影片。模型:Veo 3.1、Veo 3、Seedance 2.0、HappyHorse 1.0、Wan 2.5、Grok Imagine Video、OmniHuman、Fabric、HunyuanVideo。功能:文字轉影片、圖片轉影片、參考轉影片、影片編輯、唇形同步、虛擬人物動畫、影片放大、擬音音效。用途:社群媒體影片、行銷內容、解說影片、產品展示、AI 虛擬人物。觸發條件:影片生成、AI 影片、...
videocreativemedia
ai-video-generation
agentspace-so
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
ai-video-generation
runcomfy-com
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
ai-video-generation
doany-ai
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
character-design-sheet
qu-skills
透過參考圖與LoRA技術,確保AI生成角色圖像的一致性。涵蓋轉面視圖、表情圖集、色票與風格一致性技巧。適用於:角色設計、遊戲美術、插畫、動畫、漫畫、視覺小說。觸發詞:角色設計、角色圖集、角色一致性、角色參考、轉面圖、表情圖集、角色美術、一致角色、角色概念、參考圖、角色創作、原創角色設計...
creativedesignmedia
ckm:design
nextlevelbuilder
全方位設計技能:品牌識別、設計標記、UI樣式、標誌生成(55種風格,Gemini AI)、企業識別方案(50項交付物,CIP模擬圖)、HTML簡報(Chart.js)、橫幅設計(22種風格,社群/廣告/網頁/印刷)、圖示設計(15種風格,SVG,Gemini 3.1 Pro)、社群照片(HTML→截圖,多平台)。動作:設計標誌、建立CIP、生成模擬圖、製作投影片、設計橫幅、生成圖示、建立社群照片、社群媒體圖片、品牌...
designcreativemedia
controlnet-pose
doany-ai
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
controlnet-pose
runcomfy-com
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
elevenlabs-music-generation
agentspace-so
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
elevenlabs-music-generation
runcomfy-com
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
elevenlabs-music-generation
doany-ai
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
embedded-captions
heygen-com
為說話人頭影片添加字幕。一份包含32種視覺風格的目錄(CATALOG.md),基於兩種引擎:列式流(字幕合成至場景中——遮罩遮擋+混合模式;奶油/墨水/編輯/主題演講/紀錄片/鮮明/霓虹/故障/鉻/速度)與主題構成(錨點/軍械/終端/霓虹招牌/星塵/跺腳/記分板/轉運/VHS/街機/檔案/雷射/雷鳴/全息/生物光/極光/光譜/剪紙/彈窗/黑板/塗鴉/畫筆/水墨/勒索/末頁/夜城...
videocreativemedia
faceless-explainer
heygen-com
faceless-explainer 影片工作流程 - 任意文字(文章/筆記/主題/簡報)-> narrator_scripts.json + 音訊(語音 + 背景音樂)+ section_plan.md -> 排版/抽象圖形/圖表/數據視覺化影片。典型長度約 3 分鐘以內(最佳長度約 30-90 秒);真正較長的作品屬於 general-video,不適用此流程。會自行產生旁白(TTS)——不會與使用者提供或預先錄製的配音同步(那是 general-video)。不包含網站擷取、真實產品截圖....
videocreativemedia
flux-2-klein
doany-ai
在 RunComfy 上使用 Flux 2 Klein(Black Forest Labs 的 Flux 2 蒸餾快速變體)生成圖像 — 內建該模型的提示模式文檔,使技能比直接對同一模型進行簡單提示能獲得更高品質的輸出。記錄了 Flux 2 Klein 的優勢(亞秒級延遲、多參考品牌風格、宣告式主體優先提示)、步數策略(快速迭代用 4–8 步,精修用約 25 步)、9B 與 4B 變體的取捨,以及何時應轉向 Flux 2 Pro /...
creativeimagemedia
general-video
heygen-com
作為自訂 HyperFrames HTML 影片合成編寫的備用方案,適用於無專門工作流程可匹配的情況。涵蓋較長或多場景作品、品牌/宣傳短片、蒙太奇、標題卡、長篇動態海報、靜態循環,以及任意長度或格式的自由形式合成。不適用於行銷產品宣傳(product-launch-video)、一般網站轉影片擷取(website-to-video)、主題解說(faceless-explainer)、GitHub PR 影片(pr-to-video)、為現有素材加字幕等用途。
videocreativemedia
gpt-image
qu-skills
透過 inference.sh CLI 使用 OpenAI GPT-Image-2 生成與編輯圖片。模型:GPT-Image-2。功能:文字轉圖片、圖片編輯、修補、基於遮罩的編輯、多圖片參考、批次生成。適用於:產品模型圖、行銷視覺、圖片編輯、概念藝術、修補、照片處理。觸發詞:gpt image、gpt-image-2、openai image、chatgpt image、dall-e、dalle、openai image generation、gpt image edit、gpt inpainting、openai dall-e、gpt 4o image
creativeimagemedia
gpt-image-2
runcomfy-com
Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...
creativeimagemedia
happyhorse
qu-skills
通過 inference.sh CLI 使用阿里巴巴 HappyHorse 1.0 模型生成和編輯影片。模型:HappyHorse T2V、I2V、R2V、Video Edit。功能:文字轉影片、圖片轉影片、參考素材轉影片、自然語言影片編輯、角色保留、720P/1080P、最長15秒。用途:物理真實影片、影片編輯、角色一致內容、產品展示、社交媒體。觸發詞:happyhorse、happy horse、alibaba video、happyhorse 1.0、dashscope video、alibaba...
creativevideomedia
happyhorse-1-0
agentspace-so
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
happyhorse-1-0
runcomfy-com
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
happyhorse-1-0
doany-ai
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
higgsfield-generate
higgsfield-ai
透過Higgsfield AI生成圖片/影片。預設:圖片/設計/文字使用GPT Image 2,影片使用Seedance 2.0,角色/參考圖片工作使用Nano Banana 2/Pro,廣告使用Marketing Studio(含虛擬角色/產品/鉤子)、設定,以及Soul V2/Cinema/Cast/Location與Kling 3.0。適用時機:「生成圖片」、「製作影片」、「將此照片動畫化」、「圖片轉影片」、「編輯/風格化/重新混合此圖片」、「製作短片」、「創作廣告」、「製作UGC影片」、「產品展示」、「開箱」、「品牌影片」……
creativemediavideo
higgsfield-soul-id
higgsfield-ai
Train a Soul Character — a personalized model on a person's face that Higgsfield uses for identity-faithful image and video generation. Use when: "create my Soul", "train my face", "make my digital twin", "build me an avatar", "learn my appearance", "create a character of me", "set up identity for video", "I want my face in generated images". Chain: train Soul (one-time, returns reference_id) → use in higgsfield-generate via `--soul-id ` with models like `text2image_soul_v2` or...
creativemediavideo
hyperframes-core
heygen-com
HyperFrames HTML 組合合約。用於組合結構、資料屬性、片段、軌道、子組合、變數、媒體播放、確定性渲染規則,以及最小可渲染專案的驗證。
developmentmediacreative
hyperframes-media
heygen-com
HyperFrames 組合的資產預處理 — 多供應商 TTS(HeyGen / ElevenLabs / Kokoro 本地)、多供應商 BGM(Google Lyria / 本地 MusicGen)、Whisper 轉錄、背景移除及字幕編寫。用於 npx hyperframes tts、bgm、transcribe、remove-background、語音/供應商選擇、音樂情緒提示、字幕/副標題/歌詞/卡拉OK/逐字樣式設定。
mediaaudiovideo
hyperframes-read-first
heygen-com
任何製作、創作、生成、編輯、動畫化或渲染影片、動畫、動態圖像、解說影片、標題卡、疊加層、字幕影片、產品宣傳片、網站影片、公關或更新日誌影片、數據蒙太奇、動態海報或HyperFrames HTML組合的請求,請從此處開始。當用戶希望HyperFrames創作或渲染完成的MP4/網路影片、選擇工作流程,或在產品發布影片、無臉解說影片、網站轉影片等之間進行路由時,請在其他影片或動畫技能之前使用此功能。
creativevideomedia
Image Enhancer
composiohq
此技能可將您的圖片和截圖變得更好——更銳利、更清晰、更專業。
media
image-edit
agentspace-so
在 RunComfy 上編輯圖片——此技能為智能路由器,能將用戶意圖匹配至 RunComfy 目錄中的正確編輯模型。可選用 Nano Banana Edit(批次最多 20 張,預設保留身份)、OpenAI GPT Image 2 Edit(多語言圖內文字重寫、多參考合成、佈局精準)、Flux Kontext Pro(單參考高保真局部編輯)或 Z-Image Turbo Inpaint(遮罩驅動精確區域編輯)。整合各模型的提示模式文檔,使技能能...
creativeimagemedia
image-edit
doany-ai
在 RunComfy 上編輯圖片——此技能為智能路由器,能將用戶意圖匹配至 RunComfy 目錄中的正確編輯模型。可選用 Nano Banana Edit(批次最多 20 張,預設保留身份)、OpenAI GPT Image 2 Edit(多語言圖內文字重寫、多參考合成、佈局精準)、Flux Kontext Pro(單參考高保真局部編輯)或 Z-Image Turbo Inpaint(遮罩驅動精確區域編輯)。整合各模型的提示模式文檔,使技能能...
creativeimagemedia
image-edit
runcomfy-com
在 RunComfy 上編輯圖片——此技能為智能路由器,能將用戶意圖匹配至 RunComfy 目錄中的正確編輯模型。可選用 Nano Banana Edit(批次最多 20 張,預設保留身份)、OpenAI GPT Image 2 Edit(多語言圖內文字重寫、多參考合成、佈局精準)、Flux Kontext Pro(單參考高保真局部編輯)或 Z-Image Turbo Inpaint(遮罩驅動精確區域編輯)。整合各模型的提示模式文檔,使技能能...
creativeimagemedia
image-inpainting
agentspace-so
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-inpainting
runcomfy-com
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-inpainting
doany-ai
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-outpainting
agentspace-so
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-outpainting
doany-ai
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-outpainting
runcomfy-com
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-to-video
agentspace-so
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
runcomfy-com
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
doany-ai
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
qu-skills
靜態轉影片轉換指南:模型選擇、動態提示與鏡頭移動。涵蓋Wan 2.5 i2v、Seedance、Fabric、Grok Video及其適用時機。用途:為圖片添加動畫、從靜態圖像製作影片、加入動態效果、產品動畫。觸發詞:image to video、i2v、animate image、still to video、add motion to image、image animation、photo to video、animate still、wan i2v、image2video、bring image to life、animate photo、motion from image
creativevideomedia
kling-3-0
agentspace-so
在 RunComfy 上進行 Kling 3.0 影片生成。Kling 3.0(也稱為 Kling V3.0)是快手科技第三代多鏡頭影片模型,具備原生同步音訊及跨鏡頭一致的角色身份。此技能涵蓋所有六個 Kling 3.0 端點,橫跨三種渲染層級(標準、專業、4K)與兩種模式(文字轉影片、圖片轉影片)。透過本地 RunComfy CLI 執行 runcomfy run kling/kling-3.0/ /。觸發詞為「kling」、「kling 3.0」、「kling v3」、「kling pro」等。
creativevideomedia
kling-3-0
doany-ai
在 RunComfy 上進行 Kling 3.0 影片生成。Kling 3.0(也稱為 Kling V3.0)是快手科技的第三代多鏡頭影片模型,具備原生同步音訊及跨鏡頭一致的角色身份。此技能涵蓋所有六個 Kling 3.0 端點,橫跨三種渲染層級(標準、專業、4K)與兩種模式(文字轉影片、圖片轉影片)。透過本機 RunComfy CLI 執行 runcomfy run kling/kling-3.0/ /。觸發詞為「kling」、「kling 3.0」、「kling v3」、「kling pro」等。
videocreativemedia
kling-3-0
runcomfy-com
在RunComfy上進行Kling 3.0影片生成。Kling 3.0(也稱Kling V3.0)是快手科技的第三代多鏡頭影片模型,具備原生同步音訊及跨鏡頭一致的角色身份。此技能涵蓋所有六個Kling 3.0端點,橫跨三種渲染層級(標準、專業、4K)與兩種模式(文字轉影片、圖片轉影片)。透過本地RunComfy CLI執行runcomfy run kling/kling-3.0/ /指令。觸發關鍵詞為「kling」、「kling 3.0」、「kling v3」、「kling pro」等。
videocreativemedia
lipsync
runcomfy-com
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
lipsync
doany-ai
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
nano-banana-2
agentspace-so
使用 RunComfy 上的 Google Nano Banana 2(Gemini 系列 flash 層級文字轉圖像)生成圖片——內建該模型的提示模式,使技能輸出比直接對同一模型進行簡單提示更為精準。記錄 Nano Banana 2 的優勢(快速迭代、圖像內文字渲染、可預測構圖、可選網路接地上下文)、解析度層級定價、安全容忍度調節,以及何時轉向 Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream……
creativeimagemedia
p-image
qu-skills
透過 inference.sh CLI 使用 Pruna P-Image 模型生成圖片。模型:P-Image、P-Image-LoRA、P-Image-Edit、P-Image-Edit-LoRA。功能:文字轉圖片、圖片編輯、LoRA 風格、多圖合成、快速推論。Pruna 在不影響品質下最佳化模型速度。觸發詞:pruna、p-image、pruna image、fast image generation、optimized flux、pruna ai、p image、fast ai image、economic image generation、cheap image generation
creativeimagemedia
p-video
qu-skills
透過 inference.sh CLI 使用 Pruna P-Video 與 WAN 模型生成影片。模型:P-Video、WAN-T2V、WAN-I2V。功能:文字轉影片、圖片轉影片、音訊支援、720p/1080p、快速推論。Pruna 在不損失品質的前提下最佳化模型速度。觸發詞:pruna video、p-video、pruna ai video、fast video generation、optimized video、wan t2v、wan i2v、economic video generation、cheap video generation、pruna text to video、pruna image to video
videocreativemedia
p-video-avatar
qu-skills
使用Pruna P-Video-Avatar透過inference.sh CLI生成說話頭像影片。將人像照片轉為逼真說話影片,內建TTS。比競爭對手快18倍、便宜6倍。模型:P-Video-Avatar、P-Image(用於人像生成)。功能:文字轉頭像、音訊驅動頭像、30種語音、10種語言、720p/1080p、內建TTS、動態背景、全身控制。用途:AI主播、產品展示、解說影片、虛擬網紅、行銷...
videocreativemedia
pexo-agent
pexoai
AI影片生成技能,自動選用Seedance 2、Kling 3.0、HappyHorse等10多種模型。可從文字、圖片、網址、腳本或音訊產出完成的多鏡頭影片(5–120秒),包含AI音樂、嘴型同步與多鏡頭序列。無需撰寫提示詞,也無需選擇模型。適用於:影片製作、AI影片、製作影片、產品影片、品牌影片、宣傳短片、解說影片、短影片、TikTok影片、Instagram Reel、YouTube Short、產品廣告……
creativevideomedia
relight
agentspace-so
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
relight
doany-ai
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
relight
runcomfy-com
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
runcomfy-cli
agentspace-so
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
runcomfy-cli
runcomfy-com
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
runcomfy-cli
doany-ai
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
seedance
qu-skills
透過 inference.sh CLI 使用字節跳動 Seedance 2.0 生成影片。統一模型支援文字轉影片、圖片轉影片及參考轉影片,同步音訊,最高可達 1080p,時長 4-15 秒。提供 Pro 與 Fast 版本。Studio 版本具備私有素材庫,可保持人物一致性。適用於:社群媒體影片、音樂影片、產品展示、動畫內容、附音效的 AI 影片。觸發詞:seedance、seedance 2、bytedance video、seedance t2v、seedance i2v、seedance r2v、video with audio...
creativevideomedia
seedance-v2
agentspace-so
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
creativevideomedia
seedance-v2
doany-ai
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
videocreativemedia
seedance-v2
runcomfy-com
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
creativevideomedia
storyboard-creation
qu-skills
影視分鏡,包含鏡頭詞彙、連貫性規則與版面配置。涵蓋鏡頭類型、攝影角度、運鏡、180度法則及註記格式。適用於:影片規劃、電影前期製作、廣告分鏡、音樂影片規劃、動畫。觸發詞:storyboard、storyboarding、shot list、film planning、video planning、pre production、shot composition、camera angles、scene planning、visual script、animatic、storyboard panels、video storyboard
creativemediavideo
text-to-lottie
diffusionstudio
撰寫一個可在本地 Skia 播放器中渲染的 Lottie (Bodymovin) JSON 動畫。每當使用者要求建立、生成、編輯或修復 Lottie 動畫,或要求載入「動畫」時使用。
creativedesignmedia
video
coreyhaines31
當使用者想要透過AI工具或程式化框架來建立、生成或產出影片內容時。也適用於使用者提及「影片製作」、「AI影片」、「Remotion」、「Hyperframes」、「HeyGen」、「Synthesia」、「Veo」、「Sora」、「Runway」、「Kling」、「Seedance」、「Hailuo」、「MiniMax」、「Pika」、「Hunyuan」、「Wan」、「影片生成」、「AI虛擬人物」、「說話頭像影片」、「程式化影片」、「影片模板」、「解說影片」、「產品示範影片」、「影片流程」或「幫我做一支影片」時。使用...
videocreativemedia
video-edit
agentspace-so
在 RunComfy 上編輯現有影片 — 此技能為智慧路由器,可將使用者意圖匹配至 RunComfy 目錄中的正確編輯模型。選用 Wan 2.7 Edit-Video(通用風格重製 / 背景替換 / 包裝替換,保留身份與動作)、Kling 2.6 Pro Motion Control(將參考影片的精確動作轉移至目標角色),或 Lucy Edit Restyle(輕量級身份穩定風格重製 / 服裝替換)。整合各模型的提示模式,使技能...
videocreativemedia
video-edit
runcomfy-com
在 RunComfy 上編輯現有影片 — 此技能為智慧路由器,可將使用者意圖匹配至 RunComfy 目錄中的正確編輯模型。選用 Wan 2.7 Edit-Video(通用風格重製 / 背景替換 / 包裝替換,保留身份與動作)、Kling 2.6 Pro Motion Control(將參考影片的精確動作轉移至目標角色),或 Lucy Edit Restyle(輕量級身份穩定風格重製 / 服裝替換)。整合各模型的提示模式,使技能...
videocreativemedia
video-edit
doany-ai
在 RunComfy 上編輯現有影片 — 此技能為智慧路由器,可將使用者意圖匹配至 RunComfy 目錄中的正確編輯模型。選用 Wan 2.7 Edit-Video(通用重製風格 / 背景替換 / 包裝替換,保留身份與動作)、Kling 2.6 Pro Motion Control(將參考影片的精確動作轉移至目標角色),或 Lucy Edit Restyle(輕量級身份穩定重製風格 / 服裝替換)。整合各模型的提示模式,使技能...
videocreativemedia
video-extend
agentspace-so
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-extend
doany-ai
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-extend
runcomfy-com
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-inpainting
agentspace-so
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-inpainting
runcomfy-com
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-inpainting
doany-ai
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-outpainting
agentspace-so
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
video-outpainting
doany-ai
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
video-outpainting
runcomfy-com
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
wan-2-7
agentspace-so
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wan-2-7
runcomfy-com
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wan-2-7
doany-ai
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wonda-cli
degausai
透過 Wonda CLI 在終端機中生成圖片、影片、音樂與音訊,並支援 LinkedIn、Reddit 及 X/Twitter 的研究與自動化操作。
creativemediaresearch
YouTube Transcript Downloader
michalparkola
當使用者提供 YouTube 網址,或要求從 YouTube 下載、取得、擷取逐字稿時,下載 YouTube 影片逐字稿。也適用於使用者想要轉錄或取得 YouTube 影片的字幕/隱藏字幕時。
mediayoutubefeatured