媒体技能
ace-step
agentspace-so
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativeaudiomedia
ace-step
doany-ai
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativemediaaudio
ai-avatar-video
qu-skills
通过inference.sh CLI创建AI虚拟形象和说话头像视频。推荐:P-Video-Avatar(最快、最便宜、内置TTS)。其他选项:OmniHuman、Fabric、PixVerse。音频:Inworld TTS-2(支持100多种语言、角色情感控制)、ElevenLabs、Kokoro。功能:音频驱动虚拟形象、文本转虚拟形象、唇形同步视频、说话头像生成、虚拟主持人、UGC内容。用途:AI主持人、解说视频、虚拟网红、配音、营销视频、UGC广告、游戏虚拟形象……
videocreativemedia
ai-avatar-video
agentspace-so
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-avatar-video
runcomfy-com
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-avatar-video
doany-ai
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-image-generation
qu-skills
通过 inference.sh CLI 使用 GPT-Image-2、FLUX、Gemini、Grok、Seedream、Reve 及 50 多个模型生成 AI 图像。模型包括:GPT-Image-2、FLUX Dev LoRA、FLUX.2 Klein LoRA、Gemini 3 Pro Image、Grok Imagine、Seedream 4.5、Reve、ImagineArt。功能:文本转图像、图像转图像、修复、LoRA、图像编辑、放大、文本渲染。用途:AI 艺术、产品模型、概念艺术、社交媒体图形、营销视觉、插图。触发词:flux、图像生成、AI 图像、文本转...
creativemediaimage
ai-image-generation
agentspace-so
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-image-generation
doany-ai
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-image-generation
runcomfy-com
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-music
runcomfy-com
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativeaudiomedia
ai-music
doany-ai
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativemediaaudio
ai-video-generation
qu-skills
通过 inference.sh CLI,使用 Google Veo、Seedance 2.0、HappyHorse、Wan、Grok 及 40 多个模型生成 AI 视频。模型包括:Veo 3.1、Veo 3、Seedance 2.0、HappyHorse 1.0、Wan 2.5、Grok Imagine Video、OmniHuman、Fabric、HunyuanVideo。功能涵盖:文本转视频、图像转视频、参考转视频、视频编辑、唇形同步、虚拟形象动画、视频增强、拟音音效。适用于:社交媒体视频、营销内容、解说视频、产品演示、AI 虚拟形象。触发词:视频生成、AI 视频……
videocreativemedia
ai-video-generation
agentspace-so
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
ai-video-generation
runcomfy-com
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
ai-video-generation
doany-ai
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
character-design-sheet
qu-skills
利用参考图和LoRA技术实现AI生成图像中角色的一致性。涵盖转面视图、表情表、配色方案及风格一致性技巧。适用于:角色设计、游戏美术、插画、动画、漫画、视觉小说。触发词:角色设计、角色表、角色一致性、角色参考、转面图、表情表、角色美术、一致角色、角色概念、参考图、角色创建、OC设计……
creativedesignmedia
ckm:design
nextlevelbuilder
全面设计技能:品牌识别、设计令牌、UI样式、标志生成(55种风格,Gemini AI)、企业识别方案(50项交付物,CIP样机)、HTML演示文稿(Chart.js)、横幅设计(22种风格,社交/广告/网页/印刷)、图标设计(15种风格,SVG,Gemini 3.1 Pro)、社交照片(HTML→截图,多平台)。操作:设计标志、创建CIP、生成样机、制作幻灯片、设计横幅、生成图标、创建社交照片、社交媒体图像、品牌……
designcreativemedia
controlnet-pose
doany-ai
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
controlnet-pose
runcomfy-com
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
elevenlabs-music-generation
agentspace-so
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
elevenlabs-music-generation
runcomfy-com
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
elevenlabs-music-generation
doany-ai
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
embedded-captions
heygen-com
为说话人视频添加字幕。一个包含32种视觉风格的目录(CATALOG.md),基于两种引擎:列流(字幕合成到场景中——遮罩遮挡+混合模式;奶油/墨水/编辑/主题演讲/纪录片/响亮/霓虹/故障/铬/速度)和主题构成(锚点/军械/终端/霓虹灯/星尘/跺脚/记分牌/交通/VHS/街机/档案/激光/雷声/全息/生物光/极光/光谱/剪纸/弹出/黑板/涂鸦/画笔/水墨/勒索/最后一页/夜城...
videocreativemedia
faceless-explainer
heygen-com
faceless-explainer 视频工作流程 - 任意文本(文章/笔记/主题/简报)→ narrator_scripts.json + 音频(语音+背景音乐)+ section_plan.md → 排版/抽象图形/图表/数据可视化视频。典型时长约3分钟以内(最佳时长约30-90秒);真正较长的内容属于通用视频,不适用此工作流程。生成自身的旁白(TTS)——不与用户提供/预先录制的画外音同步(那是通用视频)。不涉及网站抓取,无真实产品截图……
videocreativemedia
flux-2-klein
doany-ai
在RunComfy上使用Flux 2 Klein(Black Forest Labs的Flux 2蒸馏快速变体)生成图像——该技能内置了模型文档中的提示模式,因此相比对同一模型进行简单提示,能获得更清晰的输出。文档说明了Flux 2 Klein的优势(亚秒级延迟、多参考品牌风格、声明式主语优先提示)、步数策略(快速迭代用4–8步,精修用约25步)、9B与4B变体的权衡,以及何时转向Flux 2 Pro/...
creativeimagemedia
general-video
heygen-com
用作自定义HyperFrames HTML视频合成创作的后备方案,适用于无专门工作流程的场景。涵盖较长或多场景作品、品牌/宣传片、蒙太奇、标题卡、动态海报、静态循环以及任意长度或格式的自由创作。不适用于营销产品推广(product-launch-video)、通用网站转视频(website-to-video)、主题解说(faceless-explainer)、GitHub PR视频(pr-to-video)、为现有素材添加字幕等场景。
videocreativemedia
gpt-image
qu-skills
通过inference.sh CLI使用OpenAI GPT-Image-2生成和编辑图像。模型:GPT-Image-2。功能:文本转图像、图像编辑、修复、基于遮罩的编辑、多图像参考、批量生成。用途:产品模型、营销视觉、图像编辑、概念艺术、修复、照片处理。触发词:gpt image、gpt-image-2、openai image、chatgpt image、dall-e、dalle、openai image generation、gpt image edit、gpt inpainting、openai dall-e、gpt 4o image
creativeimagemedia
gpt-image-2
runcomfy-com
Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...
creativeimagemedia
happyhorse
qu-skills
通过inference.sh命令行界面,使用阿里巴巴HappyHorse 1.0模型生成和编辑视频。模型包括:HappyHorse T2V、I2V、R2V、Video Edit。功能:文本转视频、图像转视频、参考转视频、自然语言视频编辑、角色保留、720P/1080P分辨率、最长15秒。用途:物理逼真视频、视频编辑、角色一致性内容、产品演示、社交媒体。触发词:happyhorse、happy horse、alibaba video、happyhorse 1.0、dashscope video、alibaba...
creativevideomedia
happyhorse-1-0
agentspace-so
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
happyhorse-1-0
runcomfy-com
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
happyhorse-1-0
doany-ai
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
higgsfield-generate
higgsfield-ai
通过Higgsfield AI生成图像/视频。默认:GPT Image 2用于图像/设计/文字,Seedance 2.0用于视频,Nano Banana 2/Pro用于角色/参考图像工作,Marketing Studio用于包含头像/产品/钩子的广告、设置,以及Soul V2/Cinema/Cast/Location和Kling 3.0。使用场景:“生成一张图像”、“制作一段视频”、“让这张照片动起来”、“图像转视频”、“编辑/风格化/混音这张图像”、“制作一个短片”、“创建一则广告”、“制作UGC视频”、“产品演示”、“开箱”、“品牌视频”……
creativemediavideo
higgsfield-soul-id
higgsfield-ai
Train a Soul Character — a personalized model on a person's face that Higgsfield uses for identity-faithful image and video generation. Use when: "create my Soul", "train my face", "make my digital twin", "build me an avatar", "learn my appearance", "create a character of me", "set up identity for video", "I want my face in generated images". Chain: train Soul (one-time, returns reference_id) → use in higgsfield-generate via `--soul-id ` with models like `text2image_soul_v2` or...
creativemediavideo
hyperframes-core
heygen-com
HyperFrames HTML 组合合约。用于组合结构、数据属性、剪辑、轨道、子组合、变量、媒体播放、确定性渲染规则以及最小可渲染项目的验证。
developmentmediacreative
hyperframes-media
heygen-com
为HyperFrames合成提供资产预处理——多供应商TTS(HeyGen / ElevenLabs / Kokoro本地)、多供应商BGM(Google Lyria / 本地MusicGen)、Whisper转录、背景移除及字幕编写。用于npx hyperframes tts、bgm、transcribe、remove-background、语音/供应商选择、音乐情绪提示、字幕/副标题/歌词/卡拉OK/逐词样式。
mediaaudiovideo
hyperframes-read-first
heygen-com
对于任何制作、创建、生成、编辑、动画化或渲染视频、动画、动态图形、解说视频、标题卡、叠加层、带字幕视频、产品宣传片、网站视频、公关或更新日志视频、数据蒙太奇、动态海报或HyperFrames HTML合成的请求,请从此处开始。当用户希望HyperFrames创作或渲染完成的MP4/网络视频、选择工作流程,或在产品发布视频、无脸解说视频、网站转视频等之间路由时,请在其他视频或动画技能之前使用此功能。
creativevideomedia
Image Enhancer
composiohq
该技能可处理您的图片和截图,使其更清晰、更锐利、更专业。
media
image-edit
agentspace-so
在RunComfy上编辑图像——该技能是一个智能路由,能将用户意图匹配到RunComfy目录中的正确编辑模型。可选择Nano Banana Edit(最多批量处理20张,默认保留身份特征)、OpenAI GPT Image 2 Edit(多语言图像内文本重写、多参考合成、布局精准)、Flux Kontext Pro(单参考高保真局部编辑)或Z-Image Turbo Inpaint(遮罩驱动的精准区域编辑)。该技能整合了每个模型的文档化提示模式,从而...
creativeimagemedia
image-edit
doany-ai
在RunComfy上编辑图像——该技能是一个智能路由,能将用户意图匹配到RunComfy目录中的正确编辑模型。可选择Nano Banana Edit(最多批量处理20张,默认保留身份特征)、OpenAI GPT Image 2 Edit(多语言图像内文本重写、多参考合成、布局精准)、Flux Kontext Pro(单参考高保真局部编辑)或Z-Image Turbo Inpaint(遮罩驱动的精准区域编辑)。该技能整合了每个模型的文档化提示模式,从而...
creativeimagemedia
image-edit
runcomfy-com
在RunComfy上编辑图像——该技能是一个智能路由,能将用户意图匹配到RunComfy目录中的正确编辑模型。可选择Nano Banana Edit(最多批量处理20张,默认保留身份特征)、OpenAI GPT Image 2 Edit(多语言图像内文本重写、多参考合成、布局精准)、Flux Kontext Pro(单参考高保真局部编辑)或Z-Image Turbo Inpaint(遮罩驱动的精准区域编辑)。该技能整合了每个模型文档中的提示模式,从而...
creativeimagemedia
image-inpainting
agentspace-so
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-inpainting
runcomfy-com
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-inpainting
doany-ai
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-outpainting
agentspace-so
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-outpainting
doany-ai
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-outpainting
runcomfy-com
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-to-video
agentspace-so
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
runcomfy-com
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
doany-ai
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
image-to-video
qu-skills
静态转视频指南:模型选择、运动提示与镜头移动。涵盖Wan 2.5 i2v、Seedance、Fabric、Grok Video及各自适用场景。用途:图像动画化、从静态图像创建视频、添加运动效果、产品动画。触发词:image to video、i2v、animate image、still to video、add motion to image、image animation、photo to video、animate still、wan i2v、image2video、bring image to life、animate photo、motion from image
creativevideomedia
kling-3-0
agentspace-so
RunComfy上的Kling 3.0视频生成。Kling 3.0(也称Kling V3.0)是快手科技第三代多镜头视频模型,具备原生同步音频和跨镜头一致的角色身份。该技能涵盖全部六个Kling 3.0端点,覆盖三种渲染等级(标准、专业、4K)和两种模式(文生视频、图生视频)。通过本地RunComfy CLI调用runcomfy run kling/kling-3.0/ /。触发词为"kling"、"kling 3.0"、"kling v3"、"kling pro"等。
creativevideomedia
kling-3-0
doany-ai
RunComfy上的Kling 3.0视频生成。Kling 3.0(也称Kling V3.0)是快手科技第三代多镜头视频模型,具备原生同步音频及跨镜头一致的角色身份。该技能覆盖全部六个Kling 3.0端点,涵盖三种渲染级别(标准、专业、4K)和两种模式(文生视频、图生视频)。通过本地RunComfy CLI调用runcomfy run kling/kling-3.0/ /。触发词为"kling"、"kling 3.0"、"kling v3"、"kling pro"等。
videocreativemedia
kling-3-0
runcomfy-com
RunComfy上的Kling 3.0视频生成。Kling 3.0(也称Kling V3.0)是快手科技第三代多镜头视频模型,具备原生同步音频和跨镜头一致的角色身份。该技能涵盖全部六个Kling 3.0端点,覆盖三种渲染等级(标准、专业、4K)和两种模式(文生视频、图生视频)。通过本地RunComfy CLI调用runcomfy run kling/kling-3.0/ /。触发词为"kling"、"kling 3.0"、"kling v3"、"kling pro"等。
videocreativemedia
lipsync
runcomfy-com
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
lipsync
doany-ai
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
nano-banana-2
agentspace-so
使用RunComfy上的Google Nano Banana 2(Gemini系列闪级文生图模型)生成图像——该技能内置了模型文档中的提示模式,因此相比直接使用同一模型进行简单提示,能获得更精准的输出。文档说明了Nano Banana 2的优势(快速迭代、图像内文字渲染、可预测构图、可选网络上下文支持)、分辨率层级定价、安全容忍度调节,以及何时转向Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream等模型。
creativeimagemedia
p-image
qu-skills
通过 inference.sh CLI 使用 Pruna P-Image 模型生成图像。模型:P-Image、P-Image-LoRA、P-Image-Edit、P-Image-Edit-LoRA。功能:文本到图像、图像编辑、LoRA 风格、多图像合成、快速推理。Pruna 在不损失质量的前提下优化模型速度。触发词:pruna、p-image、pruna image、fast image generation、optimized flux、pruna ai、p image、fast ai image、economic image generation、cheap image generation
creativeimagemedia
p-video
qu-skills
通过 inference.sh CLI 使用 Pruna P-Video 和 WAN 模型生成视频。模型:P-Video、WAN-T2V、WAN-I2V。功能:文本转视频、图像转视频、音频支持、720p/1080p、快速推理。Pruna 在不损失质量的前提下优化模型速度。触发词:pruna video、p-video、pruna ai video、fast video generation、optimized video、wan t2v、wan i2v、economic video generation、cheap video generation、pruna text to video、pruna image to video
videocreativemedia
p-video-avatar
qu-skills
通过inference.sh CLI使用Pruna P-Video-Avatar生成说话头像视频。将肖像图像转化为逼真的说话视频,内置TTS功能。速度比竞品快18倍,成本低6倍。模型:P-Video-Avatar、P-Image(用于肖像生成)。功能:文本转头像、音频驱动头像、30种语音、10种语言、720p/1080p、内置TTS、动态背景、全身控制。用途:AI主播、产品演示、解说视频、虚拟网红、营销……
videocreativemedia
pexo-agent
pexoai
AI视频生成技能,自动在Seedance 2、Kling 3.0、HappyHorse及10余个模型间选择。可从文本、图片、URL、脚本或音频生成完整多镜头视频(5–120秒),包含AI音乐、唇形同步和多镜头序列。无需编写提示词,无需选择模型。用途:视频制作、AI视频、制作视频、产品视频、品牌视频、宣传短片、解说视频、短视频、TikTok视频、Instagram Reel、YouTube Short、产品广告……
creativevideomedia
relight
agentspace-so
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
relight
doany-ai
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
relight
runcomfy-com
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
runcomfy-cli
agentspace-so
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
runcomfy-cli
runcomfy-com
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
runcomfy-cli
doany-ai
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
seedance
qu-skills
通过 inference.sh CLI 使用字节跳动 Seedance 2.0 生成视频。统一模型支持文生视频、图生视频和参考生视频,同步音频,最高1080p,时长4-15秒。提供专业版和快速版。工作室版配备私有资产库,实现人像一致性。适用于:社交媒体视频、音乐视频、产品演示、动画内容、带声音的AI视频。触发词:seedance, seedance 2, bytedance video, seedance t2v, seedance i2v, seedance r2v, video with audio,...
creativevideomedia
seedance-v2
agentspace-so
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
creativevideomedia
seedance-v2
doany-ai
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
videocreativemedia
seedance-v2
runcomfy-com
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
creativevideomedia
storyboard-creation
qu-skills
影视与视频分镜设计,涵盖镜头术语、连续性规则及分格布局。包括镜头类型、摄影角度、运镜方式、180度法则及标注格式。适用于:视频策划、电影前期制作、广告分镜、音乐视频策划、动画制作。触发词:分镜、分镜设计、镜头列表、电影策划、视频策划、前期制作、镜头构图、摄影角度、场景规划、视觉脚本、动态分镜、分镜画格、视频分镜
creativemediavideo
text-to-lottie
diffusionstudio
编写一个可在本地Skia播放器中渲染的Lottie(Bodymovin)JSON动画。当用户要求创建、生成、编辑或修复Lottie动画,或要求加载“动画”时使用。
creativedesignmedia
video
coreyhaines31
当用户希望使用AI工具或程序化框架创建、生成或制作视频内容时。也包括用户提及“视频制作”、“AI视频”、“Remotion”、“Hyperframes”、“HeyGen”、“Synthesia”、“Veo”、“Sora”、“Runway”、“Kling”、“Seedance”、“Hailuo”、“MiniMax”、“Pika”、“Hunyuan”、“Wan”、“视频生成”、“AI数字人”、“说话头像视频”、“程序化视频”、“视频模板”、“解说视频”、“产品演示视频”、“视频流水线”或“给我做个视频”等场景。使用...
videocreativemedia
video-edit
agentspace-so
编辑RunComfy上的现有视频——此技能是一个智能路由器,将用户意图匹配到RunComfy目录中的正确编辑模型。选择Wan 2.7 Edit-Video(通用重风格化/背景替换/包装替换,保留身份+动作)、Kling 2.6 Pro Motion Control(将参考视频的精确动作迁移到目标角色)或Lucy Edit Restyle(轻量级身份稳定重风格化/服装替换)。整合每个模型记录的提示模式,使该技能...
videocreativemedia
video-edit
runcomfy-com
编辑RunComfy上的现有视频——此技能是一个智能路由器,将用户意图匹配到RunComfy目录中的正确编辑模型。选择Wan 2.7 Edit-Video(通用重风格化/背景替换/包装替换,保留身份与动作)、Kling 2.6 Pro Motion Control(将参考视频的精确动作迁移至目标角色)或Lucy Edit Restyle(轻量级身份稳定重风格化/服装替换)。整合各模型文档化的提示模式,使该技能...
videocreativemedia
video-edit
doany-ai
编辑RunComfy上的现有视频——此技能是一个智能路由,能将用户意图匹配到RunComfy目录中的正确编辑模型。选择Wan 2.7 Edit-Video(通用重风格化/背景替换/包装替换,保留身份与动作)、Kling 2.6 Pro Motion Control(将参考视频的精确动作迁移至目标角色)或Lucy Edit Restyle(轻量级身份稳定重风格化/服装替换)。整合各模型文档化的提示模式,使该技能...
videocreativemedia
video-extend
agentspace-so
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-extend
doany-ai
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-extend
runcomfy-com
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-inpainting
agentspace-so
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-inpainting
runcomfy-com
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-inpainting
doany-ai
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-outpainting
agentspace-so
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
video-outpainting
doany-ai
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
video-outpainting
runcomfy-com
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
wan-2-7
agentspace-so
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wan-2-7
runcomfy-com
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wan-2-7
doany-ai
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia
wonda-cli
degausai
通过终端使用Wonda CLI生成图像、视频、音乐和音频,并进行LinkedIn、Reddit和X/Twitter的研究与自动化操作
creativemediaresearch
YouTube Transcript Downloader
michalparkola
当用户提供YouTube网址或要求从YouTube下载/获取/抓取转录文本时,下载YouTube视频转录文本。也用于用户想要转录或获取YouTube视频的字幕/隐藏式字幕时。
mediayoutubefeatured