use-my-browser

作者: xixu-me

当工作依赖于用户的实时浏览器会话或可见渲染状态而非静态抓取时使用,尤其适用于浏览器调试场景、DevTools选中的元素或请求、已登录的仪表盘或CMS流程、本地应用、表单、上传、下载、媒体检查、DOM或iframe检查、Shadow DOM,以及表现为软404、认证墙、反爬虫检测或速率限制的浏览器故障。

npx skills add https://github.com/xixu-me/skills --skill use-my-browser

Do not treat this skill as a generic browsing default. Route from the evidence you need, not from tool preference.

Every task must be classified before you choose a route:

  • static-capable: the evidence can be produced without live browser state, visible confirmation, or page interaction
  • browser-required: the evidence depends on rendered state, interaction, live session behavior, or browser-only structures

Only static-capable tasks may fall back to static retrieval, curl, or other non-browser paths. Once a task is browser-required, stay on the browser path and mark missing capability as blocked instead of silently downgrading.

Prerequisite check

This skill is for work inside the user's live browser session, not for launching a separate fresh automation browser.

Before doing browser automation, confirm that your environment already has access to a live browser stack that can provide the capabilities the task depends on, such as page inventory, task-owned page creation, page selection, snapshots or visible-state reads, DOM inspection, text or form input, uploads, dialogs, console inspection, and network inspection. The exact stack does not matter here: confirm capability, not brand.

If the live browser stack is unavailable, do not attempt browser automation through this skill. Only static-capable work may fall back to static retrieval.

Live browser automation can trigger anti-bot or anti-automation defenses on some sites. Use browser interaction only when the task truly needs it, and avoid unnecessary repetitive actions once the needed evidence has been obtained.

Experience loop

Treat site patterns as part of the browser protocol, not as optional background reading.

For browser-required work, run this loop:

  1. As soon as the target domain is known, check whether a matching note already exists under references/site-patterns/.
  2. If a note exists, read it before the first meaningful browser mutation on that domain.
  3. During the run, watch for verified site-specific facts that would change how a future run should operate.
  4. Before you consider the task complete, decide whether the run produced a reusable fact, disproved an existing fact, or produced no reusable site-specific learning.
  5. If the run verified something reusable or disproved an existing claim, update the matching note before finishing.

Do not create a domain note for one-off noise. Do not skip the end-of-run review just because the task itself succeeded.

Writeback is expected when a run verifies any of the following:

  • a stable route shape or required query parameter
  • a login, session inheritance, or isolatedContext quirk
  • a reliable interaction primitive such as hover, keyboard entry, upload sequencing, or a selector bridge pattern
  • a domain where DOM-generated links are reliable but hand-built URLs are not
  • predictable anti-automation friction or a misleading platform error state
  • a reusable media extraction or iframe / Shadow DOM access pattern

Decision guide

Start with the outcome, not the tool. Make the user's goal explicit, define what counts as done, and choose the cheapest route that can still produce the right evidence.

Use this routing order:

  1. Decide whether the task is static-capable or browser-required.
  2. If the task is static-capable, load references/task-routing.md and stay on the cheapest route that still satisfies the evidence target.
  3. If the task is browser-required, load references/browser-playbook.md.
  4. If browser-required capability is uncertain in a fresh host session, also load references/browser-capability-matrix.md.
  5. If the user already has an active browser debugging context, such as a selected inspector element or network request, also load references/debug-handoff.md.
  6. If the browser-required task touches a logged-in dashboard, admin surface, CMS, editor, or any save / publish / update flow, also load references/control-plane-workflows.md.
  7. If the current failure shape suggests a soft 404, content-unavailable state, suspicious no-op interaction, auth wall, rate limit, or anti-automation defense, also load references/anti-automation-friction.md.
  8. If the browser-required task includes iframe, Shadow DOM, collapsed content, or lazy-loaded evidence, also load references/deep-dom.md.
  9. If the important evidence lives in an image, audio clip, or video, also load references/media-inspection.md.
  10. If browser work can be divided across independent page owners or sub-agents, also load references/parallel-browser-ownership.md.
  11. If you already know a reliable selector but need an MCP-native uid target, also load references/selector-bridge.md.
  12. If page actions leave state ambiguous, a page unexpectedly navigates, an old uid may have gone stale, or console / network inspection is now needed to explain the next browser decision, also load references/browser-recovery.md.
  13. If the target site already has a matching domain note under references/site-patterns/, read that note before operating on the site.

Treat the following as browser-required by default:

  • localhost, 127.0.0.1, or benchmark-style local fixtures
  • uploads, downloads, drag-and-drop, hover, keyboard-native entry, or visible confirmation states
  • same-origin iframe inspection, Shadow DOM inspection, details / collapsed evidence, or lazy-loaded content
  • any task where "what the page visibly shows" is itself the evidence

The normal happy path for a common task is this entrypoint plus one or two references, not the entire reference set.

Hard rules

  • Use browser interaction only when live browser state is part of the evidence or required action.
  • Once a task is browser-required, do not silently downgrade.
  • Treat this file as the entrypoint and each reference file as a single-purpose authority. Do not duplicate rules across files.
  • Keep reference loading one level deep. Decide the next file from this entrypoint instead of turning one reference into a hub that links to more references.
  • Do not ask the user to log in just because a page looks restricted. First confirm whether the target content or action is actually blocked.
  • Prefer site-generated DOM links over hand-built URLs once the page has shown you the path it expects.
  • Prefer MCP-native actions over script-driven interaction when the task is genuinely an in-browser action.
  • Only close pages you created.
  • Prefer primary sources over aggregators or repeated secondary reporting.
  • If a matching site pattern note exists, read it before the first meaningful browser mutation on that domain.
  • Do not finish a browser-required task without explicitly checking whether the run should create, update, downgrade, or remove a site-pattern claim.
  • If an existing site-pattern claim fails under comparable conditions, stop trusting it, fall back to the generic workflow, and update the note instead of retrying the stale assumption.
  • Do not use curl, Invoke-WebRequest, or shell HTTP fetches for browser-required tasks.
  • Do not treat a generic page-opening tool as evidence that localhost deep interaction is available.
  • Do not switch routes just because a browser capability probe failed. Record the missing capability and stop.
  • When the user indicates an active browser debugging context, prefer handoff from that current context over fresh reproduction from scratch.

Reference index

来自 xixu-me 的更多技能

github-actions-docs
xixu-me
当用户询问如何编写、解释、自定义、迁移、保护或排查GitHub Actions工作流、工作流语法、触发器、矩阵、运行器、可复用工作流、制品、缓存、密钥、OIDC、部署、自定义操作或Actions Runner Controller时使用,尤其是当用户需要官方GitHub文档、精确链接或基于文档的YAML指导时。
developmentdevopsdocument
readme-i18n
xixu-me
当用户想要翻译仓库的README、使仓库支持多语言、本地化文档、添加语言切换器、国际化README,或更新GitHub风格仓库中的本地化README变体时使用。
documentdevelopmentapi
openclaw-secure-linux-cloud
xixu-me
在云服务器上自托管OpenClaw时使用,用于加固远程OpenClaw网关、选择SSH隧道、Tailscale或反向代理暴露方式,或审查Podman、配对、沙箱、令牌认证及工具权限默认设置,以实现安全的个人部署。
devopssecurity
develop-userscripts
xixu-me
在构建、调试、打包或发布用于Tampermonkey或ScriptCat的浏览器用户脚本时使用,包括GM API、元数据块、权限问题、@match/@grant/@connect设置、ScriptCat后台或定时脚本、UserConfig块或订阅工作流。
developmentbrowser-automationweb-scraping
secure-linux-web-hosting
xixu-me
Use when setting up, hardening, or reviewing a cloud server for self-hosting, including DNS, SSH, firewalls, Nginx, static-site hosting, reverse-proxying an app, HTTPS with Let's Encrypt or ACME clients, safe HTTP-to-HTTPS redirects, or optional post-launch network tuning such as BBR.
devopssecurityaws
opensource-guide-coach
xixu-me
当用户希望获得关于启动、贡献、发展、治理、资助、保护或维护开源项目的指导,或询问贡献者入职、社区健康、维护者倦怠、行为准则、指标、法律基础或开源项目采用等问题时使用。
developmentresearch
running-claude-code-via-litellm-copilot
xixu-me
在通过本地LiteLLM代理将Claude Code路由到GitHub Copilot时使用,以减少直接Anthropic支出、配置ANTHROPIC_BASE_URL或ANTHROPIC_MODEL覆盖,或排查Copilot代理设置失败问题,例如模型未找到、无本地流量或GitHub 401/403认证错误。
developmentapidevops
skills-cli
xixu-me
Use when users ask to discover, install, list, check, update, remove, back up, restore, sync, or initialize Agent Skills, mention `bunx skills`, `npx skills`, `skills.sh`, or `skills-lock.json`, ask "find a skill for X", or want help extending agent capabilities with installable skills.
developmentapiproductivity