D

Doany Ai Skills

ace-step
doany-ai
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativemediaaudio
ai-avatar-video
doany-ai
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-image-generation
doany-ai
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-music
doany-ai
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativemediaaudio
ai-video-generation
doany-ai
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
controlnet-pose
doany-ai
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
elevenlabs-music-generation
doany-ai
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
face-swap
doany-ai
Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap), GPT Image 2 Edit (single-shot precise face swap on still images via reference composition), Nano Banana Edit (batch identity-preserving swap), Flux Kontext (single-ref high-fidelity local face edit), and Kling 2-6 Motion Control Pro (transfer motion from one performance onto a target character). Picks the right model for...
creativevideoimage
flux-2-klein
doany-ai
Generiere Bilder mit Flux 2 Klein (Black Forest Labs' destillierte schnelle Variante von Flux 2) auf RunComfy – gebündelt mit den dokumentierten Prompting-Mustern des Modells, sodass der Skill schärfere Ergebnisse liefert als naives Prompting mit demselben Modell. Dokumentiert die Stärken von Flux 2 Klein (Latenz unter einer Sekunde, Multi-Referenz-Markenstyling, deklarative Subjekt-zuerst-Prompts), die Schrittanzahl-Strategie (4–8 für schnelle Iteration, ~25 für Verfeinerung), den Trade-off zwischen der 9B- und 4B-Variante sowie die Entscheidung, wann zu Flux 2 Pro /... weitergeleitet werden sollte.
creativeimagemedia
flux-kontext
doany-ai
Bilder mit Flux 1 Kontext Pro (Black Forest Labs' präzises lokales Bildbearbeitungsmodell) auf RunComfy bearbeiten — gebündelt mit den dokumentierten Prompting-Mustern des Modells, sodass die Fertigkeit schärfere Ergebnisse liefert als naives Prompting mit demselben Modell. Dokumentiert die Stärken von Flux Kontext (präzise lokale Bearbeitungen mit einer Referenz, starke Prompt-Kontrolle, konsistente hochtreue Ausgaben), das Schema (einzelnes Bild + Prompt) und wann stattdessen zu Nano Banana Edit / GPT Image 2 edit / Flux 2 Klein weitergeleitet werden sollte. Ruft...
creativeimagedocument
gpt-image-2
doany-ai
Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...
creativeimageapi
gpt-image-edit
doany-ai
Edit images with OpenAI GPT Image 2 (the `/edit` endpoint of ChatGPT Images 2.0) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents GPT Image Edit's strengths (preservation language, multilingual in-image text editing, multi-reference up to 10 images, layout / typography precision), the schema, and when to route to Nano Banana Edit / Flux Kontext / GPT Image 2 t2i instead. Calls...
imagecreativeapi
happyhorse-1-0
doany-ai
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
image-edit
doany-ai
Bilder auf RunComfy bearbeiten – diese Fertigkeit ist ein intelligenter Router, der die Absicht des Nutzers mit dem passenden Bearbeitungsmodell im RunComfy-Katalog abgleicht. Wählt Nano Banana Edit (Stapelverarbeitung bis zu 20, standardmäßig identitätsbewahrend), OpenAI GPT Image 2 Edit (mehrsprachige Textumschreibung im Bild, Multi-Referenz-Komposition, Layout-Präzision), Flux Kontext Pro (Einzel-Referenz, hochgenaue lokale Bearbeitung) oder Z-Image Turbo Inpaint (maskengesteuerte präzise Bereichsbearbeitung). Bündelt die dokumentierten Eingabeaufforderungsmuster jedes Modells, sodass die Fertigkeit...
creativeimagemedia
image-inpainting
doany-ai
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-outpainting
doany-ai
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-to-video
doany-ai
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
kling-3-0
doany-ai
Kling 3.0 Videogenerierung auf RunComfy. Kling 3.0 (auch Kling V3.0 genannt) ist das Multi-Shot-Videomodell der dritten Generation von Kuaishou Technology mit nativem synchronisiertem Audio und konsistenter Charakteridentität über mehrere Aufnahmen hinweg. Diese Fertigkeit deckt alle sechs Kling 3.0-Endpunkte ab, die drei Rendering-Stufen (Standard, Pro, 4K) und zwei Modi (Text-zu-Video, Bild-zu-Video) umfassen. Aufrufe erfolgen über runcomfy run kling/kling-3.0/ / über die lokale RunComfy-CLI. Auslöser sind "kling", "kling 3.0", "kling v3", "kling pro",...
videocreativemedia
lipsync
doany-ai
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
nano-banana-2
doany-ai
Generiere Bilder mit Google Nano Banana 2 (Gemini-Familie Flash-Stufe Text-zu-Bild) auf RunComfy – gebündelt mit den dokumentierten Prompting-Mustern des Modells, sodass der Skill schärfere Ergebnisse liefert als naives Prompting mit demselben Modell. Dokumentiert die Stärken von Nano Banana 2 (schnelle Iteration, In-Bild-Typografie-Rendering, vorhersagbare Bildkomposition, optional webgestützter Kontext), die Preisgestaltung nach Auflösungsstufen, den Sicherheitstoleranz-Regler und wann auf Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream umgeleitet werden sollte...
creativeimageresearch
nano-banana-edit
doany-ai
Edit images with Google Nano Banana 2 (image-to-image edit endpoint) on RunComfy. Documents Nano Banana Edit's strengths (preserve subject identity, swap background, localize edits with spatial language, multi-image batch edits up to 20 inputs), the schema, and when to route to GPT Image 2 edit / Flux Kontext / Nano Banana 2 t2i instead. Calls `runcomfy run google/nano-banana-2/edit` through the local RunComfy CLI. Triggers on "nano banana edit", "edit with nano banana", "image edit nano...
creativeimageapi
relight
doany-ai
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
runcomfy-cli
doany-ai
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
seedance-v2
doany-ai
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
videocreativemedia
video-edit
doany-ai
Vorhandenes Video auf RunComfy bearbeiten – diese Fähigkeit ist ein intelligenter Router, der die Absicht des Benutzers mit dem richtigen Bearbeitungsmodell im RunComfy-Katalog abgleicht. Wählt Wan 2.7 Edit-Video (allgemeines Restyling / Hintergrundtausch / Verpackungstausch, Identitäts- und Bewegungserhaltung), Kling 2.6 Pro Motion Control (präzise Bewegungsübertragung von einem Referenzvideo auf eine Zielperson) oder Lucy Edit Restyle (leichtgewichtiges, identitätsstabiles Restyling / Outfitwechsel). Bündelt die dokumentierten Prompt-Muster jedes Modells, sodass die Fähigkeit...
videocreativemedia
video-extend
doany-ai
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-inpainting
doany-ai
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-outpainting
doany-ai
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
wan-2-7
doany-ai
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia