website-to-video

作成者: heygen-com

Capture a general website/URL and turn it into a HyperFrames video (site tour, showcase, or social clip from the site's own visuals). Uses headless Chrome screenshots + brand assets. Use when intent is general — portfolio/blog/landing-page showcase or social clip from the site. NOT for: product/SaaS launch or promo (→ /product-launch-video, even from a URL); topic explainer with no site (→ /faceless-explainer); GitHub PR (→ /pr-to-video); adding captions to existing video (→...

npx skills add https://github.com/heygen-com/hyperframes --skill website-to-video

Website to HyperFrames

Capture a website, then produce a professional video from it.

Confirm the route before Step 0. This skill makes a video of / from a general site. If the user is really marketing / launching / promoting a product (even from this URL, even "promo for our site") → /product-launch-video. A topic explainer with no site/faceless-explainer; a GitHub PR/pr-to-video; re-cutting / recoloring / reordering an existing video file → out of scope. Routed here on a vague "make a video", or unsure launch-vs-general-site? Read /hyperframes first (full routing table + § What HyperFrames cannot do).

Users say things like:

  • "Turn this website into a 15-second social clip for Instagram"
  • "Make a 30-second site tour / showcase from https://..."
  • "Capture our homepage and build a video from its own visuals"

The workflow has 7 steps. Each produces an artifact that gates the next. By default it's collaborative — gates marked 💬 stop and ask the user. If the user signals autonomous mode ("decide for me", "surprise me"), 💬 user-preference gates are skipped; see step-2-brief.md for how that propagates.

Autonomous mode is NOT "skip all gates." Auto mode covers user-preference questions (TTS provider, voice, color emphasis, beat count, music yes/no, captions yes/no — where the agent decides on the user's behalf). It does NOT cover quality-verification gates. The following remain non-skippable in auto mode:

  • Asset Audit (Step 3) — viewing contact sheets and justifying USE/SKIP for each asset
  • Per-beat HTML read (Step 5) — structured evidence block per beat
  • DoD checklist (Step 6) — including animation-map, per-warning WCAG verification, audio/motion playback
  • Honest disclosure section (Step 6) — "What I did NOT verify" must appear in your final summary

If you find yourself reasoning "auto mode says bias toward action, so I'll skip X" — and X is a verification gate, not a preference question — that reasoning is wrong. Bias toward action applies to deciding what to build, not to deciding whether to verify.


Step 0: Capture & Understand the Brand

Read: references/step-0-capture.md

Capture the site, then read the extracted data to understand the brand and product — what it does, who it's for, what voice it speaks in, what mood it lives in. The captured assets are a brand toolkit for later, not the building blocks the video is made from.

Gate: Site summary printed — strategy-first (what the product does, who it's for, brand voice) before the asset / color / font inventory.


Step 1: Brand Identity

Read: references/step-1-design.md

Write DESIGN.md — a brand cheat sheet covering the visual identity: colors, typography, component styles, layout principles. Use design-styles.json for exact computed values.

Speed option: For fast-pacing videos (billboard-per-beat), DESIGN.md can be a 50-line summary of colors + fonts + do's/don'ts — not a 300-line document. The sub-agent prompt in Step 5 pastes brand values directly, so DESIGN.md depth only matters for complex compositions.

Gate: DESIGN.md exists (any length) with at minimum: color palette, font choices, and do's/don'ts.


Step 2: Strategy & Messaging

Read: references/step-2-brief.md, references/capabilities.md (scan the Table of Contents — deep-dive sections only as needed)

Align with the user on what the video must communicate before talking visuals or assets. Parse the user's prompt — they probably already gave you the video type and style. Ask only what's missing: the ONE thing this video must say, the narrative arc, and the audience.

Gate: Video type, duration, format, and — critically — the message and narrative arc are locked. Without those, Step 3 can't write a concept-first storyboard.


Step 3: Storyboard + Script 💬

Read: references/step-3-storyboard.md

Write the storyboard concept-first: message → narrative arc → beats that serve the arc → techniques per beat → brand accents pass at the end. Then write the narration script to match. Present both to the user with a beat-by-beat summary. Iterate until they approve.

Gate: STORYBOARD.md + SCRIPT.md exist AND the user has approved the plan.


Step 4: VO, Timing + Captions 💬

Read: references/step-4-vo.md

If Step 2 said no narration — ask about background music, then skip to Step 5. Otherwise: ask the user which TTS provider (HeyGen TTS, ElevenLabs, or Kokoro), generate audio, transcribe, map timestamps to beats. Then ask about captions.

Gate: Either (a) no narration was requested and storyboard has manual beat timings, or (b) narration.wav + transcript.json exist and beat timings updated with real durations.


Step 5: Build Compositions

Read: The hyperframes skill (load it — every rule matters) Read: references/step-5-build.md

Build index.html and compositions following the architecture and pacing chosen in the storyboard (Step 3). Sub-agents run hyperframes lint and hyperframes snapshot on each beat before reporting back.

Gate: Every compositions/beat-N.html has been read top-to-bottom by the main agent against DESIGN.md and STORYBOARD.md. The per-beat checklist lives in step-5-build.md.


Step 6: Validate & Deliver

Read: references/step-6-validate.md

Lint, validate, take snapshots scaled to video length (formula: max(beats × 3, ceil(duration_seconds / 2))), and review each one. Fix issues before delivering. Deliver the localhost Studio project URL — only render to MP4 on explicit user request. Surface that Studio URL only at handoff — it is the final, stable preview; the build-phase snapshots are headless, so do not pop a preview mid-build.

Deliver something you're proud of. Before handing off, ask yourself: would I post this on social media with my name on it? If not, fix what's wrong.

Gate: npx hyperframes lint and npx hyperframes validate pass with zero errors, and the final response includes the active Studio project URL.


Quick Reference

Video Types

Typical constraints by video type — use as a starting point, not a formula. Beat count should follow from the content and the narration, not from a target range.

TypeTypical durationDuration driverNarration
Social ad (IG/TikTok)10–15sPlatform limitOptional
Product demo30–60sScript lengthFull narration
Feature announcement15–30sFeature complexityFull narration
Brand reel20–45sMusic trackOptional, music focus
Launch teaser10–20sHook energyMinimal

Beat count is not in this table intentionally — it should come from the storyboard, not from "social ad = 3-4 beats." A social ad for a complex product might need 5 well-timed beats. A brand reel with one strong visual thesis might need 3.

Format

  • Landscape: 1920x1080 (default)
  • Portrait: 1080x1920 (Instagram Stories, TikTok)
  • Square: 1080x1080 (Instagram feed)

Reference Files

FileWhen to read
step-0-capture.mdStep 0 — capture, understand the brand and product, write strategy-first site summary
step-1-design.mdStep 1 — write DESIGN.md brand cheat sheet (5 sections, 250-350 lines; 50-line fast-path for billboard-style social ads)
step-2-brief.mdStep 2 — align on message, narrative arc, audience with user
capabilities.mdSteps 2 & 5 — full inventory of what HyperFrames can do (24 sections). Scan the TOC during the brief, deep-dive specific sections during build
step-3-storyboard.mdStep 3 — storyboard + script (combined) with user review gate
step-4-vo.mdStep 4 — TTS provider choice, generation, timing
step-5-build.mdStep 5 — build index.html + compositions
step-6-validate.mdStep 6 — lint, validate, snapshots (scaled to video length), preview
techniques.mdSteps 3 & 5 — 13 primitive animation techniques with code patterns (adapt, don't copy-paste)
html-in-canvas-patterns.mdStep 5 — complete code patterns for HTML-in-Canvas effects (lives in the hyperframes skill)

heygen-comのその他のスキル

hyperframes-cli
heygen-com
HyperFrames CLI dev loop — `npx hyperframes` for scaffolding (init), validation (lint, inspect), preview, render, and environment troubleshooting (doctor, browser, info, upgrade). Use when running any of these commands or troubleshooting the HyperFrames build/render environment. For asset preprocessing commands (`tts`, `transcribe`, `remove-background`), invoke the `hyperframes-media` skill instead.
developmenttestingapi
hyperframes-animation
heygen-com
HyperFramesのアニメーション知識全般 — アトミックモーションルール、マルチフェーズシーンブループリント、シーン遷移、広範なモーションデザイン技法、および7つのランタイムアダプター(GSAPデフォルト、Lottie、Three.js、Anime.js、CSSキーフレーム、Web Animations API、TypeGPU)。あらゆるモーションやアニメーションタスクに使用:2~4のルールを選んで構成するか、ブループリントを読み込むか、ランタイム固有のAPI(例:GSAPイージング/Lottieプレイヤー/Three.jsミキサー)を参照。HyperFramesネイティブ:単一の一時停止タイムライン、シークセーフ、...
creativedevelopmentdesign
hyperframes-core
heygen-com
HyperFrames HTML構成契約。構成構造、データ属性、クリップ、トラック、サブ構成、変数、メディア再生、決定論的レンダリングルール、および最小限のレンダリング可能なプロジェクトの検証に使用します。
developmentmediacreative
hyperframes-media
heygen-com
HyperFramesコンポジション向けのアセット前処理 — マルチプロバイダーTTS(HeyGen / ElevenLabs / Kokoroローカル)、マルチプロバイダーBGM(Google Lyria / ローカルMusicGen)、Whisper文字起こし、背景除去、キャプション作成。npx hyperframes tts、bgm、transcribe、remove-background、voice/provider選択、music-moodプロンプト、captions / subtitles / lyrics / karaoke / per-word stylingに使用。
mediaaudiovideo
hyperframes-registry
heygen-com
HyperFrames構成にレジストリブロックとコンポーネントをインストールして配線します。hyperframes addの実行時、ブロックやコンポーネントのインストール時、インストールしたアイテムをindex.htmlに配線する時、またはhyperframes.jsonを操作する時に使用します。addコマンド、インストール場所、ブロックサブ構成の配線、コンポーネントスニペットのマージ、レジストリ検出、そして新しいブロックやコンポーネントを作成して上流に貢献する方法(アイデア→スキャフォールド→検証→PR)をカバーします。
developmentapicode-review
general-video
heygen-com
カスタムHyperFrames HTMLビデオ合成オーサリングにおいて、専門的なワークフローが適合しない場合のフォールバックとして使用します。長尺またはマルチシーンの作品、ブランド/サイズルリール、モンタージュ、タイトルカード、モーションポスター(任意の長さ)、スタティックループ、および任意の長さやフォーマットの自由形式の合成をカバーします。製品プロモーション(product-launch-video)、一般的なウェブサイトからビデオへのキャプチャ(website-to-video)、トピック解説(faceless-explainer)、GitHub PRビデオ(pr-to-video)、既存映像のキャプション付けなどには使用しません。
videocreativemedia
motion-graphics
heygen-com
ユーザーが短く、デザイン主導のモーショングラフィックを求める場合に使用します。モーションそのものがメッセージとなるもの:キネティックタイポグラフィ、統計や数字のカウントアップ、チャート/データビジュアライゼーションの演出、ロゴスティング、ブランドロックアップ、ローワーサード、コールアウト、ソーシャルオーバーレイ、アニメーション付き見出し/ツイート/ニュースアイテム、モーションポスター、または素早いページキャプチャのハイライト。通常10秒未満、最大約30秒で、ナレーションの流れやボイスオーバー、実写の被写体はありません。MP4または透明オーバーレイでレンダリング可能。長尺、複数シーン、ナレーション付き、ブランドリールなどの作品には使用しません。
creativevideodesign
hyperframes-read-first
heygen-com
動画、アニメーション、モーショングラフィック、解説動画、タイトルカード、オーバーレイ、字幕付き動画、商品プロモーション、ウェブサイト動画、PR・チェンジログ動画、データモンタージュ、モーションポスター、またはHyperFrames HTMLコンポジションの作成、生成、編集、アニメーション化、レンダリングを依頼する場合は、ここから始めてください。ユーザーがHyperFramesに完成したMP4/ウェブ動画の作成やレンダリング、ワークフローの選択、またはプロダクトローンチ動画、顔出しなし解説動画、ウェブサイトから動画への変換などのルーティングを希望する場合、他の動画・アニメーションスキルよりも先に使用してください。
creativevideomedia