apify-ecommerce

作者: apify

抓取30多个平台(包括Amazon、Walmart、eBay、Shopify、WooCommerce等)的电商数据,用于定价、评论、畅销品和卖家发现。

npx skills add https://github.com/apify/awesome-skills --skill apify-ecommerce

E-Commerce Cluster

Answer natural language e-commerce questions by routing to the right Apify Actor and delivering a synthesized answer via the apify CLI.

CLI rules: Always pass --user-agent apify-awesome-skills/apify-ecommerce, --json (or the relevant --format flag on datasets get-items), and 2>/dev/null. The --user-agent flag is critical for telemetry — never omit it.

Prerequisites

(No need to check it upfront)

  • Apify CLI v1.5.0+ (npm install -g apify-cli)
  • jq (recommended for quick extraction and filtering; brew install jq on macOS, apt install jq on Linux)
  • Authentication via one of:

Verify auth: apify info --user-agent apify-awesome-skills/apify-ecommerce — should show username and userId.

Workflow

Copy this checklist and track progress:

Task Progress:
- [ ] Step 1: Detect intent and select Actor
- [ ] Step 2: Fetch Actor schema
- [ ] Step 3: Ask user preferences (format, result count)
- [ ] Step 4: Run the Actor and fetch results
- [ ] Step 5: Analyze results and deliver synthesized answer

Step 1: Detect Intent and Select Actor

Classify the user's message into an intent, then pick the right Actor.

Intent signals:

Signals in user messageIntent
price, cost, cheapest, compare prices, pricingpricing
review, rating, sentiment, stars, feedbackreviews
bestseller, top selling, most popular, trendingbestsellers
seller, vendor, reseller, who sellssellers
all products from, scrape store, full catalogstore-scrape
what platform, built on, tech stack, Shopify or WooCommercetech-stack
SEO, listing quality, product page auditseo-audit
competitor funnel, competitor pricing, conversion elementscompetitor
search intent, keyword intent, SERP intentsearch-intent
match products, same product on different platformsproduct-matching
restaurant, food delivery, DoorDash, UberEats, TheForkfood-delivery
enrich store, store metadata, store liststore-enrichment
event, concert, ticket, Eventbriteevents
property, real estate, house listing, Realtorreal-estate
Facebook ads, Meta ads, ad library, competitor adsads-intelligence
classified, Craigslist, used item for saleclassifieds
car, used car, vehicle, automotive, Webmotorsautomotive
pins, inspiration, Pinterest boards, visual search, Pinterest trendscontent-discovery
TikTok Shop, TikTok store, TikTok creatortiktok-shop
website for sale, domain for sale, Flippawebsite-marketplace

If multiple intents are detected, ask: "Do you want [intent A] or [intent B]?"

Actor routing table — always try Primary first, switch to Fallback only if it fails or returns 0 results:

IntentPlatformPrimary ActorFallback Actor
pricingAmazon / Walmart / genericapify/e-commerce-scraping-tool
pricingeBayapify/e-commerce-scraping-toolivanvs/ebay-scraper-pay-per-result
pricingEtsyapify/e-commerce-scraping-toolepctex/etsy-scraper
pricingGoogle Shoppingapify/e-commerce-scraping-toolepctex/google-shopping-scraper
pricingFacebook Marketplaceapify/e-commerce-scraping-toolapify/facebook-marketplace-scraper
pricingSHEINapify/e-commerce-scraping-toolseamless_coffer/shein-product-scraper
pricingLazadaapify/e-commerce-scraping-toolfatihtahta/lazada-scraper
pricingCanadian Tireapify/e-commerce-scraping-toolazzouzana/canadiantire-ca-scraper
pricingTescoapify/e-commerce-scraping-toolradeance/tesco-scraper
pricingShopifyapify/e-commerce-scraping-tooltrovevault/shopify-products-scraper
pricingWooCommerceapify/e-commerce-scraping-tooltrovevault/woocommerce-products-scraper
reviewsAmazon / Walmart / genericapify/e-commerce-scraping-tooljunglee/amazon-reviews-scraper
reviewsTrustpilotapify/e-commerce-scraping-toolcasper11515/trustpilot-reviews-scraper
reviewsTheForkapify/e-commerce-scraping-tooljdtpnjtp/thefork-restaurant-scraper-advanced
bestsellersAmazonapify/e-commerce-scraping-tooljunglee/amazon-bestsellers
sellersAmazonapify/e-commerce-scraping-tooljunglee/amazon-seller-scraper
sellerseBayapify/e-commerce-scraping-toolivanvs/ebay-scraper-pay-per-result
store-scrapeShopifyapify/e-commerce-scraping-tooltrovevault/shopify-products-scraper
store-scrapeWooCommerceapify/e-commerce-scraping-tooltrovevault/woocommerce-products-scraper
store-scrapeAmazonapify/e-commerce-scraping-tooljunglee/Amazon-crawler
store-scrapeFlippaapify/e-commerce-scraping-toolscraped/flippa-scraper
tech-stackanyapify/e-commerce-scraping-tooltrovevault/e-commerce-tech-stack-detector
seo-auditanyapify/e-commerce-scraping-tooltrovevault/product-listing-seo-auditor
competitoranyapify/e-commerce-scraping-tooltrovevault/competitor-intelligence-scraper---funnel-pricing-conversion
search-intentanyapify/e-commerce-scraping-tooltrovevault/ai-serp-intent-extractor---search-intent-classifier
product-matchinganyapify/e-commerce-scraping-tooltri_angle/product-matching-vectorizer
store-enrichmentanyapify/e-commerce-scraping-tooltrovevault/e-commerce-store-data-enricher
food-deliveryDoorDashapify/e-commerce-scraping-tooltri_angle/doordash-store-details-scraper
food-deliveryUberEatsapify/e-commerce-scraping-toole-commerce/ubereats-reviews-scraper
food-deliveryTheForkapify/e-commerce-scraping-tooljdtpnjtp/thefork-restaurant-scraper-advanced
ads-intelligenceFacebook / Metaapify/e-commerce-scraping-toolapify/facebook-ads-scraper
classifiedsCraigslistapify/e-commerce-scraping-toolivanvs/craigslist-scraper-pay-per-result
automotiveWebmotorsapify/e-commerce-scraping-toolstealth_mode/webmotors-auto-search-scraper
eventsEventbriteapify/e-commerce-scraping-toolaitorsm/eventbrite
real-estateRealtor.comapify/e-commerce-scraping-toolpowerai/realtor-properties-search-scraper
content-discoveryPinterestapify/e-commerce-scraping-toolfatihtahta/pinterest-scraper-search
tiktok-shopTikTok Shopapify/e-commerce-scraping-toollemur/tiktok-shop-creators
website-marketplaceFlippaapify/e-commerce-scraping-toolscraped/flippa-scraper

Step 2: Fetch Actor Schema

Fetch the Actor summary, input schema, and README:

# Summary (title, description, pricing, stats)
apify actors info "ACTOR_ID" --user-agent apify-awesome-skills/apify-ecommerce --json 2>/dev/null

# Input schema (required and optional parameters; schema lives in
# .taggedBuilds.latest.build.inputSchema as an escaped JSON string)
apify actors info "ACTOR_ID" --user-agent apify-awesome-skills/apify-ecommerce --input --json 2>/dev/null

# README (capabilities, examples, gotchas)
apify actors info "ACTOR_ID" --user-agent apify-awesome-skills/apify-ecommerce --readme 2>/dev/null

Replace ACTOR_ID with the selected Actor (e.g., apify/e-commerce-scraping-tool).

Step 3: Ask User Preferences

Before running, ask:

  1. Output format:
    • Quick answer (default) — synthesized answer in chat, no file saved
    • CSV — full export saved to disk
    • JSON — full export saved to disk
  2. Result count — suggest defaults by intent:
IntentDefault
pricing50 products
reviews200 reviews
bestsellers100 items
sellers50 sellers
store-scrapeall (unlimited)
food-delivery50 restaurants
all others20–50

Cost safety: Always set a sensible result limit in the Actor input (e.g., maxResults, resultsLimit, maxCrawledPages, or equivalent field from the input schema). Default to the per-intent values above unless the user explicitly asks for more. Warn the user before running large scrapes (1000+ results) as they consume more Apify credits.

Step 4: Run the Actor and Fetch Results

Two steps: run the Actor (blocks until done), then fetch dataset items in the requested format.

Run the Actor — returns run metadata as JSON; extract defaultDatasetId for the next step:

apify actors call "ACTOR_ID" -i 'JSON_INPUT' \
  --user-agent apify-awesome-skills/apify-ecommerce --json 2>/dev/null

From the output use .id (run ID), .status (should be SUCCEEDED), and .defaultDatasetId.

Fetch results — pick the variant based on the user's preference:

# Quick answer: total count + fields + top 5 in chat (no file)
apify datasets info DATASET_ID --json \
  --user-agent apify-awesome-skills/apify-ecommerce 2>/dev/null \
  | jq '{itemCount, fields, consoleUrl}'
apify datasets get-items DATASET_ID --limit 5 \
  --user-agent apify-awesome-skills/apify-ecommerce --format json 2>/dev/null

# CSV file
apify datasets get-items DATASET_ID \
  --user-agent apify-awesome-skills/apify-ecommerce --format csv 2>/dev/null > YYYY-MM-DD_OUTPUT_FILE.csv

# JSON file
apify datasets get-items DATASET_ID \
  --user-agent apify-awesome-skills/apify-ecommerce --format json 2>/dev/null > YYYY-MM-DD_OUTPUT_FILE.json

Other --format options: jsonl, xlsx, xml, rss, html. Use --offset N to paginate large datasets.

Tip: for anything more than a quick peek, save the dataset to a local file first (with > file.json / > file.csv) and run further analysis from disk. apify datasets get-items always streams over the network, so piping it straight into jq re-downloads the whole thing every iteration.

Combining with jq for quick extraction:

Treat jq as a complement to apify datasets get-items, not a replacement: server-side --limit / --offset / --format keeps cost and bandwidth down. Use jq on a sample item or on a file you already saved.

# Discover real field names from one sample item (Actor outputs vary —
# use this before composing further jq queries)
apify datasets get-items DATASET_ID --limit 1 --format json \
  --user-agent apify-awesome-skills/apify-ecommerce 2>/dev/null \
  | jq '.[0]'

# Quick aggregation from a JSON file you already saved with the commands above
jq '[.[] | select(.rating != null and .rating >= 4.5)] | length' YYYY-MM-DD_OUTPUT_FILE.json

Step 5: Analyze Results and Deliver Answer

After the run completes, deliver a direct synthesized answer — not a data dump:

  • Pricing: price range, average, top 5 cheapest with URLs
  • Reviews: average rating, top 3 positive and negative themes, recent snippets
  • Bestsellers: top 10 by rank with name, price, rating, URL
  • Sellers: total sellers, price range per seller, unauthorized seller flags
  • Store-scrape: total products, category breakdown, price range, stock summary
  • Tech-stack: platform detected, confidence level, notable plugins
  • Food delivery: restaurant count, average rating, price tier breakdown
  • Ads intelligence: total ads, active/inactive split, top creative formats

Error Handling

  • Auth error → run apify login, or set APIFY_TOKEN env var
  • Actor not found → check Actor ID spelling in the routing table
  • Run status FAILED → open the console URL (.consoleUrl from run metadata) for logs
  • Timeout / very long run → pass --timeout <seconds> to apify actors call
  • No results → broaden the keyword or switch to a Fallback Actor from the routing table
  • proxy is required → add "proxy": {"useApifyProxy": true} to the Actor input
  • Platform not detected → default to apify/e-commerce-scraping-tool with generic intent

来自 apify 的更多技能

bug-triage
apify
对 apify/apify-mcp-server 上的开放 bug 问题进行分类。分析、草拟回复、获取批准、发布。
official
dig
apify
用于在Apify MCP服务器上探索、规划和指定工作的灵活技能。请勿编辑源文件——此技能仅用于理解和规划。
official
apify-actor-development
apify
创建、调试和部署用于网页抓取、自动化及数据处理的无服务器云程序。支持JavaScript、TypeScript和Python模板,集成Crawlee、Playwright和Cheerio库,用于HTTP和基于浏览器的爬取。包含通过apify run进行的本地测试(使用隔离存储)、输入/输出的模式验证,以及通过apify push部署到Apify平台。需要Apify CLI认证,并在.actor/actor.json中强制包含generatedBy元数据以用于AI...
official
apify-actorization
apify
将现有项目转换为无服务器Apify Actors,支持语言特定的SDK集成。支持JavaScript/TypeScript(使用Actor.init() / Actor.exit())、Python(异步上下文管理器)以及通过CLI包装器的任何语言。提供结构化工作流:使用apify init搭建脚手架,应用SDK封装,配置输入/输出模式,通过apify run进行本地测试,然后使用apify push进行部署。包含输入和输出模式验证、Docker容器化以及可选的按事件付费...
official
apify-audience-analysis
apify
从Facebook、Instagram、YouTube和TikTok提取受众人口统计、参与模式和行为数据。支持18+个专业Actor,涵盖所有四个平台的粉丝人口统计、参与指标、评论和资料分析。提供三种输出格式:快速聊天显示、CSV导出或JSON导出,用于下游分析。需要Apify令牌和mcpc CLI工具;使用动态模式获取来调整输入以适应每个Actor的要求。包括结构化...
official
apify-brand-reputation-monitoring
apify
监控Google Maps、Booking.com、TripAdvisor、Facebook、Instagram、YouTube和TikTok上的品牌声誉。支持16+个专用Apify Actor,覆盖所有主要平台的评论、评分、评论和提及内容。灵活的输出格式:在聊天中显示结果、导出为CSV或保存为JSON供下游分析使用。需要Apify令牌和Node.js 20.6+;使用mcpc CLI动态获取Actor架构和输入参数。工作流程引导用户选择平台...
official
apify-competitor-intelligence
apify
通过Apify Actors实现多平台竞争对手分析,覆盖Google Maps、Booking.com、Facebook、Instagram、YouTube和TikTok。包含七个平台25+个专用Actors,每个针对特定分析类型优化:商业数据提取、评论对比、广告策略监控、内容表现及受众洞察。需Apify令牌、Node.js 20.6+及mcpc CLI工具以动态获取Actor架构并运行分析。支持三种输出格式:快速聊天展示...
official
apify-content-analytics
apify
通过Apify Actors实现多平台内容分析,支持Instagram、Facebook、YouTube和TikTok。涵盖17+个专业Actors,覆盖所有四个平台的帖子、Reels、故事、评论、话题标签、粉丝和广告。使用mcpc CLI动态获取Actor模式,以确定所需输入和可用输出字段。结果以三种格式输出:快速聊天显示、CSV导出或JSON导出,并可自定义结果数量。需要在.env文件中配置Apify令牌,并安装Node.js 20.6+...
official