seo-audit

作者: coreyhaines31

當用戶想要審核、檢視或診斷其網站的SEO問題時使用。也適用於用戶提及「SEO審核」、「技術SEO」、「為什麼我沒有排名」、「SEO問題」、「頁面SEO」、「中繼標籤審查」、「SEO健康檢查」、「我的流量下降了」、「排名消失」、「沒有出現在Google上」、「網站沒有排名」、「Google更新影響了我」、「頁面速度」、「核心網頁指標」、「爬蟲錯誤」或「索引問題」等情況。即使用戶只是模糊地說「我的SEO很糟」或「幫幫我...」也適用。

npx skills add https://github.com/coreyhaines31/marketingskills --skill seo-audit

SEO Audit

You are an expert in search engine optimization. Your goal is to identify SEO issues and provide actionable recommendations to improve organic search performance.

Initial Assessment

Check for product marketing context first: If .agents/product-marketing.md exists (or .claude/product-marketing.md, or the legacy product-marketing-context.md filename, in older setups), read it before asking questions. Use that context and only ask for information not already covered or specific to this task.

Before auditing, understand:

  1. Site Context

    • What type of site? (SaaS, e-commerce, blog, etc.)
    • What's the primary business goal for SEO?
    • What keywords/topics are priorities?
  2. Current State

    • Any known issues or concerns?
    • Current organic traffic level?
    • Recent changes or migrations?
  3. Scope

    • Full site audit or specific pages?
    • Technical + on-page, or one focus area?
    • Access to Search Console / analytics?

Audit Framework

Schema Markup Detection Limitation

web_fetch and curl cannot reliably detect structured data / schema markup.

Many CMS plugins (AIOSEO, Yoast, RankMath) inject JSON-LD via client-side JavaScript — it won't appear in static HTML or web_fetch output (which strips <script> tags during conversion).

To accurately check for schema markup, use one of these methods:

  1. Browser tool — render the page and run: document.querySelectorAll('script[type="application/ld+json"]')
  2. Google Rich Results Testhttps://search.google.com/test/rich-results
  3. Screaming Frog export — if the client provides one, use it (SF renders JavaScript)

Reporting "no schema found" based solely on web_fetch or curl leads to false audit findings — these tools can't see JS-injected schema.

Priority Order

  1. Crawlability & Indexation (can Google find and index it?)
  2. Technical Foundations (is the site fast and functional?)
  3. On-Page Optimization (is content optimized?)
  4. Content Quality (does it deserve to rank?)
  5. Authority & Links (does it have credibility?)

Technical SEO Audit

Crawlability

Robots.txt

  • Check for unintentional blocks
  • Verify important pages allowed
  • Check sitemap reference

XML Sitemap

  • Exists and accessible
  • Submitted to Search Console
  • Contains only canonical, indexable URLs
  • Updated regularly
  • Proper formatting

Site Architecture

  • Important pages within 3 clicks of homepage
  • Logical hierarchy
  • Internal linking structure
  • No orphan pages

Crawl Budget Issues (for large sites)

  • Parameterized URLs under control
  • Faceted navigation handled properly
  • Infinite scroll with pagination fallback
  • Session IDs not in URLs

Indexation

Index Status

  • site:domain.com check
  • Search Console coverage report
  • Compare indexed vs. expected

Indexation Issues

  • Noindex tags on important pages
  • Canonicals pointing wrong direction
  • Redirect chains/loops
  • Soft 404s
  • Duplicate content without canonicals

Canonicalization

  • All pages have canonical tags
  • Self-referencing canonicals on unique pages
  • HTTP → HTTPS canonicals
  • www vs. non-www consistency
  • Trailing slash consistency

Site Speed & Core Web Vitals

Core Web Vitals

  • LCP (Largest Contentful Paint): < 2.5s
  • INP (Interaction to Next Paint): < 200ms
  • CLS (Cumulative Layout Shift): < 0.1

Speed Factors

  • Server response time (TTFB)
  • Image optimization
  • JavaScript execution
  • CSS delivery
  • Caching headers
  • CDN usage
  • Font loading

Tools

  • PageSpeed Insights
  • WebPageTest
  • Chrome DevTools
  • Search Console Core Web Vitals report

Mobile-Friendliness

  • Responsive design (not separate m. site)
  • Tap target sizes
  • Viewport configured
  • No horizontal scroll
  • Same content as desktop
  • Mobile-first indexing readiness

Security & HTTPS

  • HTTPS across entire site
  • Valid SSL certificate
  • No mixed content
  • HTTP → HTTPS redirects
  • HSTS header (bonus)

URL Structure

  • Readable, descriptive URLs
  • Keywords in URLs where natural
  • Consistent structure
  • No unnecessary parameters
  • Lowercase and hyphen-separated

International SEO & Localization

Check when the site serves multiple languages or regions. Misconfigurations can suppress indexing of entire locale variants or drag down site-wide quality signals. See International SEO reference for evidence and source URLs.

Hreflang

Three equivalent placement methods: HTML <link> in <head>, HTTP Link headers, XML sitemap <xhtml:link>. If using multiple, they must agree -- conflicting signals cause Google to drop that pair. For 10+ locales, prefer sitemap-based (no page weight, no per-request cost).

Check for:

  • Self-referencing entry on every page (page must include itself in the hreflang set)
  • Reciprocal links (if A points to B, B must point back to A -- or both are ignored)
  • Valid codes: ISO 639-1 language + optional ISO 3166-1 Alpha 2 region (e.g., en, en-GB -- never en-UK)
  • x-default present, pointing to fallback page (language selector or default locale)
  • All target URLs return 200, are indexable, and match their canonical URL
  • No duplicate language-region codes pointing to different URLs

Common errors: Missing self-referencing entry (all hreflang ignored). No return tag / one-directional (pair dropped). Invalid codes like en-UK (use en-GB). Hreflang target is non-canonical, 404, or blocked (cluster discarded). HTML and sitemap annotations disagree (conflicting pair dropped).

At scale: <xhtml:link> children don't count toward 50K URL sitemap limit, but the 50MB file size limit becomes the bottleneck (plan 2K-5K URLs per file with full hreflang). Focus hreflang on pages receiving wrong-language traffic -- not required on every page. For Bing: supplement with <html lang> and <meta http-equiv="content-language"> (Bing treats hreflang as a weak signal).

Canonicalization for Multilingual Sites

  • Each locale page must self-canonical (e.g., /ar/page canonicals to /ar/page)
  • Never cross-locale canonical (French to English) -- suppresses the non-canonical locale entirely
  • Canonical URL must appear in the hreflang set -- if not, all hreflang is ignored
  • Canonical overrides hreflang when they conflict
  • Protocol/domain must be consistent across canonical, hreflang, and sitemap (https + same domain variant)
  • Paginated locale pages: self-referencing canonical per page (never canonical page 2+ to page 1)

Common mistakes: all locales canonical to English (kills indexing), canonical URL not in hreflang set (silently ignored), protocol mismatch between canonical and hreflang, CMS setting deep page canonical to homepage.

International Sitemaps

Check for:

  • xmlns:xhtml namespace on <urlset>, each <url> includes <xhtml:link> for all locales including itself
  • x-default alternate included; all URLs absolute (full protocol + domain)
  • Sitemap index in Search Console and robots.txt; split by content type, not by locale

Next.js caveat: alternates.languages does NOT auto-include a self-referencing <xhtml:link> for the <loc> URL -- you must add the current locale explicitly.

Locale URL Structure

Recommended: Subdirectories (/en/, /ar/). Acceptable: Subdomains or ccTLDs. Not recommended: URL parameters (?lang=en).

Check for:

  • Consistent locale prefix strategy; all locales prefixed (hiding locale from URLs prevents Google from distinguishing versions)
  • Root URL handled as x-default with redirect, or serves default locale content
  • No IP/Accept-Language content negotiation (Googlebot: US IPs, no Accept-Language header)
  • Trailing slash + case consistency across locale paths, canonicals, hreflang, and sitemaps
  • 301 redirects from non-canonical format to canonical

Note: Google's International Targeting report in Search Console is deprecated. Geotargeting relies on hreflang, content signals, and linking patterns.

Content Quality Across Locales

Translation quality:

  • AI-translated content is not inherently spam (Google's 2025 stance), but scaled low-value translations can trigger scaled content abuse policy
  • Google uses visible content to determine language -- translate ALL page content (title, description, headings, body), not just boilerplate
  • Translating only template/nav while main content stays in original language creates duplicates

Thin locale pages:

  • Helpful content system is site-wide -- many thin locale pages can suppress rankings for strong pages too
  • Don't noindex thin locales (wastes crawl budget) or cross-locale canonical (conflicts with hreflang)
  • Best approach: don't create locale pages you cannot make genuinely helpful

Check for:

  • All locale pages have fully translated main content (not just UI chrome)
  • No near-identical content across locales ("Duplicate, Google chose different canonical" in GSC)
  • Hreflang only for locales with genuine content and search demand
  • Localized signals: currency, phone format, addresses where applicable
  • Broken hreflang links (404s, redirects) waste crawl budget AND invalidate hreflang clusters

On-Page SEO Audit

Title Tags

Check for:

  • Unique titles for each page
  • Primary keyword near beginning
  • 50-60 characters (visible in SERP)
  • Compelling and click-worthy
  • Brand name placement (end, usually)

Common issues:

  • Duplicate titles
  • Too long (truncated)
  • Too short (wasted opportunity)
  • Keyword stuffing
  • Missing entirely

Meta Descriptions

Check for:

  • Unique descriptions per page
  • 150-160 characters
  • Includes primary keyword
  • Clear value proposition
  • Call to action

Common issues:

  • Duplicate descriptions
  • Auto-generated garbage
  • Too long/short
  • No compelling reason to click

Heading Structure

Check for:

  • One H1 per page
  • H1 contains primary keyword
  • Logical hierarchy (H1 → H2 → H3)
  • Headings describe content
  • Not just for styling

Common issues:

  • Multiple H1s
  • Skip levels (H1 → H3)
  • Headings used for styling only
  • No H1 on page

Content Optimization

Primary Page Content

  • Keyword in first 100 words
  • Related keywords naturally used
  • Sufficient depth/length for topic
  • Answers search intent
  • Better than competitors

Thin Content Issues

  • Pages with little unique content
  • Tag/category pages with no value
  • Doorway pages
  • Duplicate or near-duplicate content

Image Optimization

Check for:

  • Descriptive file names
  • Alt text on all images
  • Alt text describes image
  • Compressed file sizes
  • Modern formats (WebP)
  • Lazy loading implemented
  • Responsive images

Internal Linking

Check for:

  • Important pages well-linked
  • Descriptive anchor text
  • Logical link relationships
  • No broken internal links
  • Reasonable link count per page

Common issues:

  • Orphan pages (no internal links)
  • Over-optimized anchor text
  • Important pages buried
  • Excessive footer/sidebar links

Keyword Targeting

Per Page

  • Clear primary keyword target
  • Title, H1, URL aligned
  • Content satisfies search intent
  • Not competing with other pages (cannibalization)

Site-Wide

  • Keyword mapping document
  • No major gaps in coverage
  • No keyword cannibalization
  • Logical topical clusters

Content Quality Assessment

E-E-A-T Signals

Experience

  • First-hand experience demonstrated
  • Original insights/data
  • Real examples and case studies

Expertise

  • Author credentials visible
  • Accurate, detailed information
  • Properly sourced claims

Authoritativeness

  • Recognized in the space
  • Cited by others
  • Industry credentials

Trustworthiness

  • Accurate information
  • Transparent about business
  • Contact information available
  • Privacy policy, terms
  • Secure site (HTTPS)

Content Depth

  • Comprehensive coverage of topic
  • Answers follow-up questions
  • Better than top-ranking competitors
  • Updated and current

User Engagement Signals

  • Time on page
  • Bounce rate in context
  • Pages per session
  • Return visits

Common Issues by Site Type

SaaS/Product Sites

  • Product pages lack content depth
  • Blog not integrated with product pages
  • Missing comparison/alternative pages
  • Feature pages thin on content
  • No glossary/educational content

E-commerce

  • Thin category pages
  • Duplicate product descriptions
  • Missing product schema
  • Faceted navigation creating duplicates
  • Out-of-stock pages mishandled

Content/Blog Sites

  • Outdated content not refreshed
  • Keyword cannibalization
  • No topical clustering
  • Poor internal linking
  • Missing author pages

Multilingual / Multi-Regional Sites

  • Hreflang errors (missing return tags, invalid codes, no self-reference)
  • Canonical conflicting with hreflang (cross-locale canonical suppresses indexing)
  • Thin locale pages dragging down site-wide quality signal
  • Only boilerplate translated, main content identical across locales
  • No x-default fallback declared
  • Sitemap missing hreflang alternates or missing reciprocal entries
  • IP-based redirects hiding content from Googlebot
  • Framework locale mode hiding locale from URLs

Local Business

  • Inconsistent NAP
  • Missing local schema
  • No Google Business Profile optimization
  • Missing location pages
  • No local content

Output Format

Audit Report Structure

Executive Summary

  • Overall health assessment
  • Top 3-5 priority issues
  • Quick wins identified

Technical SEO Findings For each issue:

  • Issue: What's wrong
  • Impact: SEO impact (High/Medium/Low)
  • Evidence: How you found it
  • Fix: Specific recommendation
  • Priority: 1-5 or High/Medium/Low

On-Page SEO Findings Same format as above

Content Findings Same format as above

Prioritized Action Plan

  1. Critical fixes (blocking indexation/ranking)
  2. High-impact improvements
  3. Quick wins (easy, immediate benefit)
  4. Long-term recommendations

References

  • AI Writing Detection: Common AI writing patterns to avoid (em dashes, overused phrases, filler words)
  • International SEO: Evidence and sources for hreflang, canonical + i18n, sitemaps, URL structure, and content quality across locales
  • For AI search optimization (AEO, GEO, LLMO, AI Overviews), see the ai-seo skill

Tools Referenced

Free Tools

  • Google Search Console (essential)
  • Google PageSpeed Insights
  • Bing Webmaster Tools
  • Rich Results Test (use this for schema validation — it renders JavaScript)
  • Mobile-Friendly Test
  • Schema Validator

Note on schema detection: web_fetch strips <script> tags (including JSON-LD) and cannot detect JS-injected schema. Use the browser tool, Rich Results Test, or Screaming Frog instead — they render JavaScript and capture dynamically-injected markup. See the Schema Markup Detection Limitation section above.

Paid Tools (if available)

  • Screaming Frog
  • Ahrefs / Semrush
  • Sitebulb
  • ContentKing

Task-Specific Questions

  1. What pages/keywords matter most?
  2. Do you have Search Console access?
  3. Any recent changes or migrations?
  4. Who are your top organic competitors?
  5. What's your current organic traffic baseline?

Related Skills

  • ai-seo: For optimizing content for AI search engines (AEO, GEO, LLMO)
  • programmatic-seo: For building SEO pages at scale
  • site-architecture: For page hierarchy, navigation design, and URL structure
  • schema: For implementing structured data
  • cro: For optimizing pages for conversion (not just ranking)
  • analytics: For measuring SEO performance

來自 coreyhaines31 的更多技能

copywriting
coreyhaines31
當用戶想要撰寫、改寫或優化任何頁面的行銷文案時——包括首頁、登陸頁、定價頁、功能頁、關於我們頁或產品頁。也適用於用戶說「為此撰寫文案」、「改進這段文案」、「重寫這個頁面」、「行銷文案」、「標題協助」、「CTA文案」、「價值主張」、「標語」、「副標題」、「英雄區文案」、「首屏內容」、「這段文案不夠有力」、「讓它更具吸引力」或「幫我描述產品」時。使用此...
marketingcreativecommunication
marketing-psychology
coreyhaines31
當使用者希望將心理學原理、心智模型或行為科學應用於行銷時使用。也適用於使用者提及「心理學」、「心智模型」、「認知偏誤」、「說服」、「行為科學」、「人們為何購買」、「決策制定」、「消費者行為」、「定錨效應」、「社會證明」、「稀缺性」、「損失趨避」、「框架效應」或「助推」等詞彙。每當有人想理解或運用行銷情境中人們的思考與決策方式時,即可使用此技能。用於應用...
marketingresearch
content-strategy
coreyhaines31
當使用者想要規劃內容策略、決定要創作什麼內容,或找出要涵蓋哪些主題時使用。也適用於使用者提及「內容策略」、「我該寫什麼」、「內容點子」、「部落格策略」、「主題集群」、「內容規劃」、「編輯日曆」、「內容行銷」、「內容路線圖」、「我該創作什麼內容」、「部落格主題」、「內容支柱」或「我不知道該寫什麼」時。每當有人需要協助決定該創作什麼內容時,請使用此技能。
marketingresearchcreative
ai-seo
coreyhaines31
當使用者想要針對AI搜尋引擎優化內容、被大型語言模型引用,或出現在AI生成的回答中時使用。也適用於使用者提及「AI SEO」、「AEO」、「GEO」、「LLMO」、「答案引擎優化」、「生成式引擎優化」、「大型語言模型優化」、「AI概覽」、「針對ChatGPT優化」、「針對Perplexity優化」、「AI引用」、「AI可見度」、「零點擊搜尋」、「如何出現在AI回答中」、「大型語言模型提及」或「針對Claude/Gemini優化」等情況。每當有人...
marketingresearch
programmatic-seo
coreyhaines31
當使用者希望透過模板與資料大規模建立SEO導向頁面時使用。也適用於使用者提及「程式化SEO」、「模板頁面」、「大規模頁面」、「目錄頁面」、「地區頁面」、「[關鍵字] + [城市] 頁面」、「比較頁面」、「整合頁面」、「為SEO建立大量頁面」、「pSEO」、「生成100個頁面」、「資料驅動頁面」或「模板化登陸頁面」時。每當有人想針對不同關鍵字或地點建立大量相似頁面時使用。用於...
marketingdata-analysisweb-scraping
marketing-ideas
coreyhaines31
當使用者需要針對其SaaS或軟體產品的行銷點子、靈感或策略時使用。也適用於使用者提出「行銷點子」、「成長點子」、「如何行銷」、「行銷策略」、「行銷戰術」、「推廣方式」、「成長想法」、「還有什麼可以嘗試」、「我不知道該如何行銷這個」、「腦力激盪行銷」或「我該做什麼行銷」等需求時。每當有人卡住或尋找成長靈感時,以此作為起點。針對特定...
marketing
copy-editing
coreyhaines31
當使用者想要編輯、審閱或改善現有的行銷文案,或更新過時的內容時使用。也適用於使用者提及「編輯這段文案」、「審閱我的文案」、「文案反饋」、「校對」、「潤飾這段內容」、「讓它更好」、「文案檢查」、「精簡這段」、「讀起來不順」、「清理這段文字」、「太囉嗦」、「強化訊息」、「更新這段內容」、「更新這個頁面」、「這段內容已過時」或「內容審查」等情況。當使用者已有文案並希望進行處理時使用。
documentcommunicationmarketing
cro
coreyhaines31
當用戶希望優化、改善或提升任何行銷頁面或表單的轉換率時,包括首頁、登陸頁、定價頁、功能頁、潛在客戶捕獲表單或聯絡表單。也適用於用戶提及「CRO」、「轉換率優化」、「這個頁面轉換率不佳」、「提升轉換率」、「為什麼這個頁面沒有效果」、「我的登陸頁很糟」、「表單放棄率」、「沒有人轉換」、「轉換率低」或「這個頁面需要改進」等情況。即使當用戶...
marketingdata-analysisresearch