delegating-to-otto

作者: astronomer

驅動 Astronomer 的 Otto 代理

npx skills add https://github.com/astronomer/agents --skill delegating-to-otto

Delegating to Otto

Otto is Astronomer's data-engineering agent, bundled with the Astro CLI as astro otto. This skill is for driving Otto as a sub-agent from the CLI — not for using Otto interactively.

If you are Otto, stop here. Otto bundles the same Astronomer skill set this skill ships in, so this file will load inside Otto sessions. Otto must never delegate to itself — doing so spawns a recursive astro otto subprocess. Ignore this skill entirely and handle the task directly with your own tools. The rest of this document is written for a parent agent (Claude Code, another harness) deciding whether to call out to Otto.

When delegating pays off

Signals that favor delegating:

  • Airflow upgrades, migrations, runtime/provider compat questions. Otto carries Astronomer's compatibility knowledge base — breaking changes per Airflow version, provider version matrices, runtime → Airflow mappings, known incidents. Generic agents don't have this and will often fabricate plausible-sounding answers.
  • Live-Airflow investigation. Production failure diagnosis, run-history analysis, log triage. Otto's af tooling against a connected Airflow is wired up and bundled with debugging skills; replicating it ad-hoc in the parent session is wasteful.
  • Long, self-contained subtasks. Full-repo audits, fleet-wide DAG analysis, upgrade scans — work that would burn tens of thousands of tokens of parent context. Delegating keeps the parent thread cheap and the result is one summary back, not a turn-by-turn trace.
  • Parallel branches. Use --fork to explore an alternative ("what if we used Cosmos here?") without polluting the main thread.
  • Tasks that lean on team memory. Otto reads .astro/memory/ (committed) and ~/.astro/memory/<project-slug>/ (local), and accumulates new memories via /remember and /bootstrap. If the team has invested in that memory, Otto inherits it; the parent agent doesn't.

Signals against delegating:

  • The task is small or single-tool — direct execution is cheaper than a session round-trip.
  • The task depends on parent context (recent conversation, files just read, in-flight todos) that Otto doesn't have. Briefing Otto would cost more than just doing the work.
  • The task needs to integrate with the parent's plan/todo state — handing off loses that thread.
  • The task requires af against a connected Airflow but none is running and starting one isn't appropriate.

When a task hits multiple favoring signals (e.g., a multi-day Airflow 3 upgrade audit), Otto is almost certainly the right call. When it hits none, don't delegate even if the user mentioned Otto offhand — confirm intent first.

How to use this skill: check what else is loaded first

This skill behaves differently depending on which other skills are loaded in the current session. Scan the loaded skills list before deciding.

When sibling skills are loaded

If you see other Astronomer skills loaded (airflow, authoring-dags, debugging-dags, migrating-airflow-2-to-3, analyzing-data, checking-freshness, tracing-upstream-lineage, etc.), the user has the full astronomer-data plugin. Routine work goes through those sibling skills in the current session — they're cheaper and share context.

Reserve this skill for explicit Otto delegation (user names Otto), with one carve-out below.

User intentUse this skill?Use instead
"What DAGs are broken?"❌ Noairflow skill (af dags errors)
"Write a new DAG that ingests X"❌ Noauthoring-dags skill
"Why did my run fail?"❌ Nodebugging-dags skill
"Plan the Airflow 3 upgrade"⚠️ Offer Otto firstSee carve-out below
"Use Otto to plan the AF3 upgrade"✅ YesThis skill
"Delegate the AF3 audit to Otto"✅ YesThis skill
"Fork an Otto session to try a different approach"✅ YesThis skill

Carve-out: Airflow 2 → 3 migrations

Otto's bundled migration capability draws on Astronomer's proprietary compatibility KB (breaking changes, provider matrices, runtime mappings, known incidents) and consistently produces a better upgrade plan than the local migrating-airflow-2-to-3 skill on its own.

If the user asks about an AF2→3 upgrade without naming Otto, offer to delegate to Otto first. Short ask:

"Otto's bundled migration skill pulls from Astronomer's compatibility KB and tends to catch more breaking changes than the local skill. Want me to run this through Otto?"

If they accept, use this skill. If they decline, fall back to migrating-airflow-2-to-3.

When sibling skills aren't loaded

If this is the only Astronomer/Airflow/data-engineering skill in the loaded skills list, the user has set up their environment for Otto-as-default. Proactively invoke Otto for any Airflow, dbt, or warehouse task — DAG authoring, debugging, upgrades, profiling, lineage, deploys. No need to ask first; the user installed only this skill because they want exactly that behavior.

Quick invocation

# Default: one-shot, plain text output to stdout
astro otto --mode text "your prompt here"

# Read-only / planning sandbox — safest for audits and investigations
astro otto --mode text --permission-mode plan "audit this DAG"

# Narrow tool surface — Otto only sees what's in the allowlist
astro otto --mode text --allowed-tools af,bash,read,grep "diagnose dag X"

# Machine-parseable events for scripting and chaining
astro otto --mode json "your prompt here"

For session continuity (-c, --fork, --session, --no-session), see Session control. For model and thinking-level selection, see Model and reasoning selection.

Session control

Sessions persist on disk per working directory.

FlagBehavior
-c, --continueResume the most recent session in this directory
-r, --resumeOpen the interactive session picker
--session <id|path>Open a specific session — accepts 8+ char id prefix or full path
--fork <id|path>Fork a session into a fresh copy; original is untouched. Use to try an alternative approach without polluting the main thread.
--no-sessionIn-memory only, leaves no trace on disk. Use for one-off questions.
--export <id|path>Render an existing session to HTML and exit

Mode selection

FlagWhen to use
--mode textDefault. Streams plain text to stdout.
--mode jsonMachine-parseable events for scripting or chaining.

For text mode, streaming auto-detects by TTY. Force with --stream / --no-stream.

Permission modes

Otto can write files and run shell commands. Match the permission mode to the task's risk profile.

ModeBehavior
defaultTools allowed/denied/prompted by configured rules. Otto asks before destructive astro/af commands.
planRead-only sandbox. Blocks edit and write entirely. Restricts bash to a read-only allowlist (ls, cat, git, rg, af, astro, etc.). Use this for audits, planning, and investigation.
acceptEditsAuto-allows edit and write inside the project folder. Other tools fall through to normal rules.
confirmEditsPrompts before every edit, write, or non-read-only bash. Allow rules can't bypass the prompt.
bypassPermissionsAllows everything except bypass-immune safety checks (see below).

Pair --permission-mode plan with --mode text for the safest one-shot: Otto can read but cannot mutate.

--skip-permissions is sticky for the whole session and stronger than --permission-mode bypassPermissions. Avoid unless the user explicitly asks.

Bypass-immune safety checks

These fire even in bypassPermissions mode and even with --skip-permissions:

  • Reads/writes to sensitive files: .env*, ~/.ssh/**, ~/.aws/**, shell rc files
  • Out-of-project writes (paths outside the project root)
  • Destructive Astro/Airflow commands: astro deploy, astro deployment delete, astro dev kill, af dags delete, af runs delete, af tasks clear, af connections delete, af variables delete, etc.

Don't assume --skip-permissions makes Otto fully unattended.

Tool allowlists

--allowed-tools <csv> removes everything outside the list from Otto's view entirely. Useful for narrow tasks:

# Only let Otto query Airflow and read files
astro otto --mode text --allowed-tools af,read,grep,find \
  "diagnose why model_orders failed yesterday"

# Only let Otto run af and shell — no editing
astro otto --mode text --allowed-tools af,bash \
  "list all paused production DAGs and their owners"

Structured output

Force Otto to emit a typed final answer with --output-schema:

astro otto --mode json --output-schema @schema.json \
  "find DAGs with import errors and return as JSON"

Requires --mode text or --mode json. Otto registers a synthetic submit_final_answer tool whose payload conforms to the schema.

Model and reasoning selection

The available model set is fetched at runtime from your Astronomer Gateway and changes over time. Don't hardcode model names — list what's available first:

astro otto --list-models                  # full list
astro otto --list-models anthropic        # filter by substring

astro otto --model <id> --mode text "..."
astro otto --thinking <off|minimal|low|medium|high|xhigh> --mode text "..."

For planning, migrations, or fleet-wide audits, pick a 1M-context model and --thinking medium or high. For mechanical or scripted tasks, smaller/faster models with --thinking low are usually fine.

Defaults persist in ~/.astro/otto/settings.json.

MCP servers and extensions

  • MCP: pass --mcp-config /path/to/mcp.json to wire in user-configured servers (warehouse, ticketing, etc.). Otto's Airflow tooling (af) is built in — no MCP needed for that.
  • Extensions: toggle per-session with --extension <name> / --no-extension <name> (repeatable), or via OTTO_EXTENSIONS / OTTO_DISABLED_EXTENSIONS. Persistent settings live in ~/.astro/otto/extensions.json and .astro/otto/extensions.json.

Common delegation patterns

Plan-only investigation

astro otto --mode text --permission-mode plan --thinking medium \
  "your investigation prompt"

Scripted pipeline with structured output

astro otto --mode json --output-schema @schema.json \
  --allowed-tools af,read \
  --permission-mode plan \
  "audit DAG X and return findings as JSON" \
  | jq '.final_answer'

For multi-turn delegation, kick off once and resume with -c. For parallel branches, see --fork in Session control.

Cost and latency

Each invocation spins up a fresh agent with its own context window. Two rules cover most cases:

  • Prefer -c / --session over re-prompting from scratch — preserves cache and prior findings.
  • Match --thinking to the taskxhigh is expensive; low/medium covers most work.

What Otto auto-detects

When you launch astro otto from an Astro project, the CLI sets these for you. You don't need to export them:

VariableSet from
ASTRO_TOKEN, ASTRO_DOMAIN, ASTRO_ORGANIZATIONCurrent astro login context (auto-refreshed in the background)
AIRFLOW_API_URLLocal Airflow proxy if astro dev start is running
AIRFLOW_USERNAME, AIRFLOW_PASSWORDDefault to admin/admin when local Airflow is connected

Otto also walks up from the cwd to /, loading any AGENTS.md or CLAUDE.md it finds (plus ~/.astro/otto/AGENTS.md). When both files exist in the same folder, AGENTS.md wins. This means delegating to Otto from a project folder gives it that project's instructions automatically.

Caveat: af requires a connected Airflow

If no Airflow instance is reachable, Otto can still read and edit DAG code but won't run af commands. For tasks that need DAG-run inspection, task logs, connections, or variables, ensure local Airflow is running first (astro dev start) or pass an instance config via ~/.af/config.yaml.

Auto DAG validation

The dag-validation extension is on by default. After Otto edits or writes any dags/*.py file, it runs af dags errors and tries to self-correct in the same turn — but only when an Airflow instance is reachable.

This is convenient for delegated DAG edits, but means:

  • Delegated edits without a running Airflow won't be auto-validated.
  • Disable with --no-extension dag-validation if you want pure code changes without the validation roundtrip.

Subagent extension (off by default)

Otto can fan out to its own subprocesses via the subagent extension. Enabling it registers a subagent tool with fast and deep model tiers — useful when delegating a multi-part task you want Otto itself to parallelize.

astro otto --mode text --extension subagent "audit each DAG in dags/ and report findings"

Configure tier models in .astro/otto/extensions.json.

Settings precedence

Otto resolves config in this order (earlier wins):

  1. CLI flag (--model, --allowed-tools, --no-extension, etc.)
  2. Environment variable (OTTO_DISABLED_EXTENSIONS, etc.)
  3. Project file (.astro/otto/permissions.json, .astro/otto/extensions.json, .astro/config.yaml)
  4. User file (~/.astro/otto/settings.json, ~/.astro/config.yaml)
  5. Built-in default

For full reference see Otto settings.

Verifying Otto is available

astro otto version    # installed Otto version + update check
astro otto --help     # full flag reference
astro otto update     # pull latest Otto release

Otto auto-updates by default (once-per-day check, applied on next launch). Opt out with astro config set -g otto.auto_update false.

If astro otto isn't recognized, the user needs Astro CLI v1.42+. Recommend brew upgrade astro or whatever installer they used.

Authoritative references

來自 astronomer 的更多技能

airflow
astronomer
查詢、管理及疑難排解 Apache Airflow 的 DAG、執行、任務與系統設定。支援 30 多種指令,涵蓋 DAG 檢查、執行管理、任務日誌、設定查詢及直接 REST API 存取。可管理多個 Airflow 實例並保留設定;自動探索本機與 Astro 部署。同步(等待完成)或非同步觸發 DAG 執行、診斷失敗、清除執行以重試,並透過重試/映射索引篩選存取任務日誌。輸出...
official
airflow-hitl
astronomer
使用可延遲運算子,在 Airflow DAG 中實現人工審批關卡、表單輸入與分支流程。包含四種運算子類型:ApprovalOperator 用於核准/拒絕決策、HITLOperator 用於多選項表單選擇、HITLBranchOperator 用於人工驅動的任務路由,以及 HITLEntryOperator 用於表單資料收集。所有運算子皆為可延遲,在等待人工回應時釋放工作槽位,可透過 Airflow UI 的「必要操作」標籤或 REST API 進行回應。支援選用功能,包括自訂...
official
airflow-plugins
astronomer
構建 Airflow 3.1+ 插件,將 FastAPI 應用、自訂 UI 頁面、React 元件、中介軟體、巨集和運算子連結直接嵌入 Airflow UI。使用…
official
analyzing-data
astronomer
查詢您的資料倉儲,利用快取的模式與概念映射來回答商業問題。支援針對重複問題類型的模式查詢與快取,並記錄結果以改善未來查詢。包含概念到表格的映射快取,以及透過INFORMATION_SCHEMA或程式碼庫grep進行的表格結構探索。提供run_sql()與run_sql_pandas()核心函式,回傳Polars或Pandas DataFrame供分析使用。CLI指令可管理概念、模式與表格快取,以及...
official
annotating-task-lineage
astronomer
使用 inlets 和 outlets 為 Airflow 任務標註資料血緣。支援 OpenLineage Dataset 物件、Airflow Assets 與 Airflow Datasets,用於定義跨資料庫、資料倉儲及雲端儲存的輸入與輸出。當運算子缺乏內建 OpenLineage 提取器時,可作為備用方案;遵循四層優先級系統,其中自訂提取器與 OpenLineage 方法具有優先權。包含針對 Snowflake、BigQuery、S3 及 PostgreSQL 的資料集命名輔助工具,以確保一致性...
official
authoring-dags
astronomer
建立Apache Airflow DAG的引導式工作流程,包含驗證與測試整合。結構化六階段方法:探索環境與現有模式、規劃DAG結構、遵循最佳實踐進行實作、使用af CLI指令驗證、經使用者同意後測試,以及根據修正反覆迭代。用於探索的CLI指令(af config connections、af config providers、af dags list)與驗證指令(af dags errors、af dags get、af dags explore)可提供DAG的即時回饋。
official
blueprint
astronomer
使用 Pydantic 驗證定義可重複使用的 Airflow 任務組模板,並從 YAML 組合 DAG。適用於建立 blueprint 模板、從 YAML 組合 DAG 等場景。
official
checking-freshness
astronomer
透過檢查表格時間戳記及更新模式,並比對過時程度量表,驗證資料的新鮮度。利用常見的ETL命名模式(如 _loaded_at、_updated_at、created_at 等)識別時間戳記欄位,並查詢其最大值以判斷資料年齡。將資料分類為四種新鮮度狀態:新鮮(少於4小時)、過時(4–24小時)、非常過時(超過24小時)或未知(未找到時間戳記)。提供SQL範本,用於檢查最近幾天的上次更新時間與資料列數量趨勢。
official