use-my-browser

作者: xixu-me

當工作依賴於用戶的即時瀏覽器工作階段或可見的渲染狀態,而非靜態擷取時使用,特別適用於瀏覽器除錯情境、開發者工具選取的元素或請求、已登入的儀表板或CMS流程、本地主機應用程式、表單、上傳、下載、媒體檢查、DOM或iframe檢查、Shadow DOM,以及看似軟性404、驗證牆、反機器人檢查或速率限制的瀏覽器故障。

npx skills add https://github.com/xixu-me/skills --skill use-my-browser

Do not treat this skill as a generic browsing default. Route from the evidence you need, not from tool preference.

Every task must be classified before you choose a route:

  • static-capable: the evidence can be produced without live browser state, visible confirmation, or page interaction
  • browser-required: the evidence depends on rendered state, interaction, live session behavior, or browser-only structures

Only static-capable tasks may fall back to static retrieval, curl, or other non-browser paths. Once a task is browser-required, stay on the browser path and mark missing capability as blocked instead of silently downgrading.

Prerequisite check

This skill is for work inside the user's live browser session, not for launching a separate fresh automation browser.

Before doing browser automation, confirm that your environment already has access to a live browser stack that can provide the capabilities the task depends on, such as page inventory, task-owned page creation, page selection, snapshots or visible-state reads, DOM inspection, text or form input, uploads, dialogs, console inspection, and network inspection. The exact stack does not matter here: confirm capability, not brand.

If the live browser stack is unavailable, do not attempt browser automation through this skill. Only static-capable work may fall back to static retrieval.

Live browser automation can trigger anti-bot or anti-automation defenses on some sites. Use browser interaction only when the task truly needs it, and avoid unnecessary repetitive actions once the needed evidence has been obtained.

Experience loop

Treat site patterns as part of the browser protocol, not as optional background reading.

For browser-required work, run this loop:

  1. As soon as the target domain is known, check whether a matching note already exists under references/site-patterns/.
  2. If a note exists, read it before the first meaningful browser mutation on that domain.
  3. During the run, watch for verified site-specific facts that would change how a future run should operate.
  4. Before you consider the task complete, decide whether the run produced a reusable fact, disproved an existing fact, or produced no reusable site-specific learning.
  5. If the run verified something reusable or disproved an existing claim, update the matching note before finishing.

Do not create a domain note for one-off noise. Do not skip the end-of-run review just because the task itself succeeded.

Writeback is expected when a run verifies any of the following:

  • a stable route shape or required query parameter
  • a login, session inheritance, or isolatedContext quirk
  • a reliable interaction primitive such as hover, keyboard entry, upload sequencing, or a selector bridge pattern
  • a domain where DOM-generated links are reliable but hand-built URLs are not
  • predictable anti-automation friction or a misleading platform error state
  • a reusable media extraction or iframe / Shadow DOM access pattern

Decision guide

Start with the outcome, not the tool. Make the user's goal explicit, define what counts as done, and choose the cheapest route that can still produce the right evidence.

Use this routing order:

  1. Decide whether the task is static-capable or browser-required.
  2. If the task is static-capable, load references/task-routing.md and stay on the cheapest route that still satisfies the evidence target.
  3. If the task is browser-required, load references/browser-playbook.md.
  4. If browser-required capability is uncertain in a fresh host session, also load references/browser-capability-matrix.md.
  5. If the user already has an active browser debugging context, such as a selected inspector element or network request, also load references/debug-handoff.md.
  6. If the browser-required task touches a logged-in dashboard, admin surface, CMS, editor, or any save / publish / update flow, also load references/control-plane-workflows.md.
  7. If the current failure shape suggests a soft 404, content-unavailable state, suspicious no-op interaction, auth wall, rate limit, or anti-automation defense, also load references/anti-automation-friction.md.
  8. If the browser-required task includes iframe, Shadow DOM, collapsed content, or lazy-loaded evidence, also load references/deep-dom.md.
  9. If the important evidence lives in an image, audio clip, or video, also load references/media-inspection.md.
  10. If browser work can be divided across independent page owners or sub-agents, also load references/parallel-browser-ownership.md.
  11. If you already know a reliable selector but need an MCP-native uid target, also load references/selector-bridge.md.
  12. If page actions leave state ambiguous, a page unexpectedly navigates, an old uid may have gone stale, or console / network inspection is now needed to explain the next browser decision, also load references/browser-recovery.md.
  13. If the target site already has a matching domain note under references/site-patterns/, read that note before operating on the site.

Treat the following as browser-required by default:

  • localhost, 127.0.0.1, or benchmark-style local fixtures
  • uploads, downloads, drag-and-drop, hover, keyboard-native entry, or visible confirmation states
  • same-origin iframe inspection, Shadow DOM inspection, details / collapsed evidence, or lazy-loaded content
  • any task where "what the page visibly shows" is itself the evidence

The normal happy path for a common task is this entrypoint plus one or two references, not the entire reference set.

Hard rules

  • Use browser interaction only when live browser state is part of the evidence or required action.
  • Once a task is browser-required, do not silently downgrade.
  • Treat this file as the entrypoint and each reference file as a single-purpose authority. Do not duplicate rules across files.
  • Keep reference loading one level deep. Decide the next file from this entrypoint instead of turning one reference into a hub that links to more references.
  • Do not ask the user to log in just because a page looks restricted. First confirm whether the target content or action is actually blocked.
  • Prefer site-generated DOM links over hand-built URLs once the page has shown you the path it expects.
  • Prefer MCP-native actions over script-driven interaction when the task is genuinely an in-browser action.
  • Only close pages you created.
  • Prefer primary sources over aggregators or repeated secondary reporting.
  • If a matching site pattern note exists, read it before the first meaningful browser mutation on that domain.
  • Do not finish a browser-required task without explicitly checking whether the run should create, update, downgrade, or remove a site-pattern claim.
  • If an existing site-pattern claim fails under comparable conditions, stop trusting it, fall back to the generic workflow, and update the note instead of retrying the stale assumption.
  • Do not use curl, Invoke-WebRequest, or shell HTTP fetches for browser-required tasks.
  • Do not treat a generic page-opening tool as evidence that localhost deep interaction is available.
  • Do not switch routes just because a browser capability probe failed. Record the missing capability and stop.
  • When the user indicates an active browser debugging context, prefer handoff from that current context over fresh reproduction from scratch.

Reference index

來自 xixu-me 的更多技能

github-actions-docs
xixu-me
當使用者詢問如何撰寫、解釋、自訂、遷移、保護或疑難排解 GitHub Actions 工作流程、工作流程語法、觸發器、矩陣、執行器、可重複使用工作流程、成品、快取、密碼、OIDC、部署、自訂動作或 Actions Runner Controller 時使用,特別是當他們需要官方 GitHub 文件、精確連結或基於文件的 YAML 指引時。
developmentdevopsdocument
readme-i18n
xixu-me
當使用者想要翻譯儲存庫的README、讓儲存庫支援多語言、在地化文件、加入語言切換器、國際化README,或是在GitHub風格的儲存庫中更新已在地化的README版本時使用。
documentdevelopmentapi
openclaw-secure-linux-cloud
xixu-me
在雲端伺服器上自行託管 OpenClaw 時使用,用於強化遠端 OpenClaw 閘道、選擇 SSH 隧道、Tailscale 或反向代理暴露方式,或審查 Podman、配對、沙箱、令牌驗證及工具權限預設值,以確保安全的個人部署。
devopssecurity
develop-userscripts
xixu-me
在構建、除錯、打包或發佈用於 Tampermonkey 或 ScriptCat 的瀏覽器使用者腳本時使用,包括 GM API、元數據區塊、權限問題、@match/@grant/@connect 設定、ScriptCat 背景或排程腳本、UserConfig 區塊或訂閱工作流程。
developmentbrowser-automationweb-scraping
secure-linux-web-hosting
xixu-me
在設定、強化或檢視用於自架站的雲端伺服器時使用,包括 DNS、SSH、防火牆、Nginx、靜態網站託管、應用程式的反向代理、使用 Let's Encrypt 或 ACME 用戶端的 HTTPS、安全的 HTTP 到 HTTPS 重新導向,或可選的啟動後網路調校(如 BBR)。
devopssecurityaws
opensource-guide-coach
xixu-me
當使用者希望獲得關於啟動、貢獻、發展、治理、資助、保護或維持開源專案的指導,或詢問貢獻者入門、社群健康、維護者倦怠、行為準則、指標、法律基礎或開源專案採用的相關問題時使用。
developmentresearch
running-claude-code-via-litellm-copilot
xixu-me
在透過本地 LiteLLM 代理將 Claude Code 路由至 GitHub Copilot 時使用,以減少直接 Anthropic 花費、設定 ANTHROPIC_BASE_URL 或 ANTHROPIC_MODEL 覆寫,或疑難排解 Copilot 代理設定失敗,例如模型未找到、無 localhost 流量或 GitHub 401/403 驗證錯誤。
developmentapidevops
skills-cli
xixu-me
Use when users ask to discover, install, list, check, update, remove, back up, restore, sync, or initialize Agent Skills, mention `bunx skills`, `npx skills`, `skills.sh`, or `skills-lock.json`, ask "find a skill for X", or want help extending agent capabilities with installable skills.
developmentapiproductivity