firecrawl-knowledge-ingest

작성자: firecrawl

Firecrawl 브라우저를 사용하여 공개 또는 인증된 지식 베이스와 문서 포털을 수집합니다. JS가 많이 사용된 문서, 로그인이 필요한 포털, 페이지가 나뉜 헬프 센터, 지원 지식 베이스, 또는 문서 사이트에서 구조화된 JSON/마크다운 추출에 사용하세요.

npx skills add https://github.com/firecrawl/firecrawl-workflows --skill firecrawl-knowledge-ingest

ZIP 다운로드 GitHub

107

Firecrawl Knowledge Ingest

Use this when a docs portal needs browser navigation, auth, pagination, or JS rendering.

Onboarding Interview

Infer the portal URL, output format, auth needs, and page limit from context. If the portal is clear, proceed immediately.

Ask at most 1-3 concise questions only if blocked, such as the portal URL, whether authentication is required, or the desired output format.

Firecrawl Collection Plan

Use Firecrawl browser to:

open the portal and inspect navigation
identify sections, categories, sidebar links, and article URLs
follow sidebar navigation, next links, pagination, load-more controls, or search
scrape article content as markdown
extract metadata such as title, section, last updated date, author, and tags

Try Firecrawl map as a supplement for public URLs, but use browser navigation for auth-gated or JS-heavy content.

Final Deliverable

# Knowledge Ingest: [Portal]

## Summary
[Pages extracted, sections covered, limitations]

## Output
[JSON/markdown/merged file path or content]

## Sections
[Section names and article counts]

## Failed Or Restricted Pages
[Any access/loading issues]

## Sources
[URLs extracted]

## Rerun Inputs
workflow: firecrawl-knowledge-ingest
url: [portal url]
format: [json/markdown/merged]
max_pages: [number]