apify-ultimate-scraper

द्वारा apify

स्वचालित वेब स्क्रैपर जो Instagram, YouTube, Facebook, Google Maps और अन्य सहित 55+ प्लेटफार्मों के लिए इष्टतम एक्टर्स का चयन करता है। 8 प्रमुख प्लेटफार्मों पर 55+ पूर्व-कॉन्फ़िगर्ड एक्टर्स को कवर करता है, जिसमें उपयोग-मामला-विशिष्ट चयन मार्गदर्शन (लीड जनरेशन, इन्फ्लुएंसर खोज, ब्रांड

npx skills add https://github.com/apify/agent-skills --skill apify-ultimate-scraper

ZIP डाउनलोड करें GitHub

2.2k

Universal web scraper

AI-driven data extraction from ~100 Actors across 15+ platforms via the Apify CLI.

Rules for every apify command:

Pass --json for machine-readable output (stable across CLI versions).
Pass --user-agent apify-agent-skills/apify-ultimate-scraper for telemetry attribution.
Redirect stderr with 2>/dev/null (stderr contains progress messages that break JSON parsers).

Prerequisites

Apify CLI v1.5.0+ (npm install -g apify-cli)
Authenticated session (see below)

Authentication

If a CLI command fails with an auth error, authenticate using one of these methods:

OAuth (interactive): apify login (opens browser)
Environment variable: export APIFY_TOKEN=your_token_here
From .env file: source .env (if the file contains APIFY_TOKEN=...)

Generate token: https://console.apify.com/settings/integrations

Workflow

Step 1: Understand goal and select Actor

Identify the target platform and use case. Read references/actor-index.md to find the right Actor.

If the task involves a multi-step pipeline, also read the matching workflow guide:

Task involves...	Read
leads, contacts, emails, B2B	`references/workflows/lead-generation.md`
competitor, ads, pricing	`references/workflows/competitive-intel.md`
influencer, creator	`references/workflows/influencer-vetting.md`
brand, mentions, sentiment	`references/workflows/brand-monitoring.md`
reviews, ratings, reputation	`references/workflows/review-analysis.md`
SEO, SERP, crawl, content, RAG	`references/workflows/content-and-seo.md`
analytics, engagement, performance	`references/workflows/social-media-analytics.md`
trends, keywords, hashtags	`references/workflows/trend-research.md`
jobs, recruiting, candidates	`references/workflows/job-market-and-recruitment.md`
real estate, listings, hotels	`references/workflows/real-estate-and-hospitality.md`
price monitoring, e-commerce, products	`references/workflows/ecommerce-price-monitoring.md`
contact enrichment, email extraction	`references/workflows/contact-enrichment.md`
knowledge base, RAG, LLM data feed	`references/workflows/knowledge-base-and-rag.md`
company research, due diligence	`references/workflows/company-research.md`

If no Actor matches in the index, search dynamically:

apify actors search "KEYWORDS" --user-agent apify-agent-skills/apify-ultimate-scraper --json --limit 10 2>/dev/null

From results: items[].username/items[].name (Actor ID), items[].title, items[].stats.totalUsers30Days, items[].currentPricingInfo.pricingModel.

Step 2: Fetch Actor schema and check gotchas

Fetch the input schema dynamically:

apify actors info "ACTOR_ID" --user-agent apify-agent-skills/apify-ultimate-scraper --input --json 2>/dev/null

Also read references/gotchas.md to check for common pitfalls for the selected Actor.

For Actor documentation: apify actors info "ACTOR_ID" --user-agent apify-agent-skills/apify-ultimate-scraper --readme

Step 3: Configure and run

Skip user preferences for simple lookups (e.g., "Nike's follower count"). Go straight to running with quick answer mode.

For larger tasks, confirm output format (quick answer / CSV / JSON) and result count.

Standard run (blocking):

apify actors call "ACTOR_ID" --input-file input.json --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null

Prefer --input-file input.json for large or complex inputs. For tiny inputs, inline JSON is acceptable with shell quoting: --input '{"maxItems":10}'.

From output: .id (run ID), .status, .defaultDatasetId, .stats.durationMillis

Fetch results:

apify datasets get-items DATASET_ID --user-agent apify-agent-skills/apify-ultimate-scraper --format json

For CSV: apify datasets get-items DATASET_ID --user-agent apify-agent-skills/apify-ultimate-scraper --format csv

Quick answer mode: Fetch results as JSON, pick top 5, present formatted in chat.

Save to file: Fetch results, use Write tool to save as YYYY-MM-DD_descriptive-name.csv or .json.

Large/long-running scrapes:

apify actors start "ACTOR_ID" --input-file input.json --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null

Poll: apify runs info RUN_ID --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null (check .status for SUCCEEDED).

Step 4: Deliver results

Report: result count, file location (if saved), key data fields, and links:

Dataset: https://console.apify.com/storage/datasets/DATASET_ID
Run: https://console.apify.com/actors/runs/RUN_ID

For multi-step workflows: suggest the next pipeline step from the workflow guide.

Troubleshooting

Common errors and pitfalls are documented in references/gotchas.md. Read it before running PPE (pay-per-event) Actors.

apify की और Skills

bug-triage

apify

apify/apify-mcp-server पर खुले बग मुद्दों को ट्रायेज करें। विश्लेषण करें, प्रतिक्रिया का मसौदा तैयार करें, अनुमोदन प्राप्त करें, पोस्ट करें।

official

dig

apify

Apify MCP सर्वर पर कार्य की खोज, योजना और विशिष्टीकरण के लिए लचीला कौशल। स्रोत फ़ाइलों को संपादित न करें — यह कौशल केवल समझने और योजना बनाने के लिए है।

official

apify-actor-development

apify

वेब स्क्रैपिंग, ऑटोमेशन और डेटा प्रोसेसिंग के लिए सर्वरलेस क्लाउड प्रोग्राम बनाएं, डीबग करें और डिप्लॉय करें। JavaScript, TypeScript और Python टेम्पलेट्स को इंटीग्रेटेड Crawlee, Playwright और Cheerio लाइब्रेरीज़ के साथ HTTP और ब्राउज़र-आधारित क्रॉलिंग के लिए सपोर्ट करता है। इसमें apify run के माध्यम से आइसोलेटेड स्टोरेज के साथ लोकल टेस्टिंग, इनपुट/आउटपुट के लिए स्कीमा वैलिडेशन और apify push के माध्यम से Apify

official

apify-actorization

apify

मौजूदा प्रोजेक्ट्स को भाषा-विशिष्ट SDK एकीकरण के साथ सर्वरलेस Apify एक्टर्स में बदलें। JavaScript/TypeScript (Actor.init() / Actor.exit() के साथ), Python (एसिंक कॉन्टेक्स्ट मैनेजर), और CLI रैपर के माध्यम से किसी भी भाषा को सपोर्ट करता है। संरचित वर्कफ़्लो प्रदान करता है: स्कैफोल्ड करने के लिए apify init, SDK रैपिंग लागू करना, इनपुट/आउटपुट स्कीमा कॉन्फ़िगर करना, apify run के साथ स्थानीय रूप से परीक्षण करना, फिर apify push के स

official

apify-audience-analysis

apify

Facebook, Instagram, YouTube और TikTok से दर्शकों की जनसांख्यिकी, जुड़ाव पैटर्न और व्यवहार डेटा निकालें। सभी चार प्लेटफ़ॉर्म पर फ़ॉलोअर जनसांख्यिकी, जुड़ाव मीट्रिक्स, टिप्पणियाँ और प्रोफ़ाइल विश्लेषण को कवर करने वाले 18+ विशेषज्ञ एक्टर्स का समर्थन करता है। तीन आउटपुट प्रारूप प्रदान करता है: त्वरित चैट प्रदर्शन, CSV निर्यात, या डाउनस्ट्रीम विश्लेषण के लिए JSON निर्यात। Apify टोकन और mcpc CLI टूल की आवश्यकता है; प्रत्येक

official

apify-brand-reputation-monitoring

apify

Google Maps, Booking.com, TripAdvisor, Facebook, Instagram, YouTube और TikTok पर ब्रांड प्रतिष्ठा की निगरानी करें। समीक्षाओं, रेटिंग्स, टिप्पणियों और उल्लेखों को कवर करने वाले 16+ समर्पित Apify एक्टर्स का समर्थन करता है। लचीले आउटपुट प्रारूप: परिणाम चैट में प्रदर्शित करें, CSV में निर्यात करें, या डाउनस्ट्रीम विश्लेषण के लिए JSON के रूप में सहेजें। Apify टोकन और Node.js 20.6+ की आवश्यकता है; एक्टर स्कीमा और इनपुट पैरामीटर गतिशील रूप से प्राप्त करने के लिए mc

official

apify-competitor-intelligence

apify

एपिफाई एक्टर्स के माध्यम से गूगल मैप्स, बुकिंग.कॉम, फेसबुक, इंस्टाग्राम, यूट्यूब और टिकटॉक के लिए बहु-प्लेटफॉर्म प्रतिस्पर्धी विश्लेषण। सात प्लेटफॉर्म पर 25 से अधिक विशेष एक्टर्स को कवर करता है, प्रत्येक विशिष्ट विश्लेषण प्रकारों के लिए अनुकूलित: व्यवसाय डेटा निष्कर्षण, समीक्षा तुलना, विज्ञापन रणनीति निगरानी, सामग्री प्रदर्शन और दर्शक अंतर्दृष्टि। एक्टर स्कीमा प्राप्त करने और ग

official

apify-content-analytics

apify

Apify Actors के माध्यम से Instagram, Facebook, YouTube और TikTok के लिए मल्टी-प्लेटफ़ॉर्म कंटेंट एनालिटिक्स। सभी चार प्लेटफ़ॉर्म पर पोस्ट, रील्स, स्टोरीज़, कमेंट्स, हैशटैग, फ़ॉलोअर्स और विज्ञापनों को कवर करने वाले 17+ विशेषज्ञ Actors को सपोर्ट करता है। mcpc CLI का उपयोग करके आवश्यक इनपुट और उपलब्ध आउटपुट फ़ील्ड निर्धारित करने के लिए Actor स्कीमा को डायनामिक रूप से प्राप्त करता है। परिणाम तीन प्रारूपों में आउटपुट करता है: त्वर

official