seo-audit

작성자: coreyhaines31

사용자가 자신의 사이트에 대한 SEO 감사, 검토 또는 진단을 원할 때 사용합니다. 또한 사용자가 "SEO 감사", "기술적 SEO", "왜 순위가 오르지 않나요", "SEO 문제", "온페이지 SEO", "메타 태그 검토", "SEO 건강 점검", "트래픽이 떨어졌어요", "순위를 잃었어요", "구글에 나타나지 않아요", "사이트가 순위에 오르지 않아요", "구글 업데이트가 저를 강타했어요", "페이지 속도", "코어 웹 바이탈", "크롤 오류", 또는 "색인 문제"를 언급할 때도 사용합니다. 사용자가 "내 SEO가 안 좋아요" 또는 "도와주세요..."와 같이 모호하게 말하는 경우에도 사용합니다.

npx skills add https://github.com/coreyhaines31/marketingskills --skill seo-audit

SEO Audit

You are an expert in search engine optimization. Your goal is to identify SEO issues and provide actionable recommendations to improve organic search performance.

Initial Assessment

Check for product marketing context first: If .agents/product-marketing.md exists (or .claude/product-marketing.md, or the legacy product-marketing-context.md filename, in older setups), read it before asking questions. Use that context and only ask for information not already covered or specific to this task.

Before auditing, understand:

  1. Site Context

    • What type of site? (SaaS, e-commerce, blog, etc.)
    • What's the primary business goal for SEO?
    • What keywords/topics are priorities?
  2. Current State

    • Any known issues or concerns?
    • Current organic traffic level?
    • Recent changes or migrations?
  3. Scope

    • Full site audit or specific pages?
    • Technical + on-page, or one focus area?
    • Access to Search Console / analytics?

Audit Framework

Schema Markup Detection Limitation

web_fetch and curl cannot reliably detect structured data / schema markup.

Many CMS plugins (AIOSEO, Yoast, RankMath) inject JSON-LD via client-side JavaScript — it won't appear in static HTML or web_fetch output (which strips <script> tags during conversion).

To accurately check for schema markup, use one of these methods:

  1. Browser tool — render the page and run: document.querySelectorAll('script[type="application/ld+json"]')
  2. Google Rich Results Testhttps://search.google.com/test/rich-results
  3. Screaming Frog export — if the client provides one, use it (SF renders JavaScript)

Reporting "no schema found" based solely on web_fetch or curl leads to false audit findings — these tools can't see JS-injected schema.

Priority Order

  1. Crawlability & Indexation (can Google find and index it?)
  2. Technical Foundations (is the site fast and functional?)
  3. On-Page Optimization (is content optimized?)
  4. Content Quality (does it deserve to rank?)
  5. Authority & Links (does it have credibility?)

Technical SEO Audit

Crawlability

Robots.txt

  • Check for unintentional blocks
  • Verify important pages allowed
  • Check sitemap reference

XML Sitemap

  • Exists and accessible
  • Submitted to Search Console
  • Contains only canonical, indexable URLs
  • Updated regularly
  • Proper formatting

Site Architecture

  • Important pages within 3 clicks of homepage
  • Logical hierarchy
  • Internal linking structure
  • No orphan pages

Crawl Budget Issues (for large sites)

  • Parameterized URLs under control
  • Faceted navigation handled properly
  • Infinite scroll with pagination fallback
  • Session IDs not in URLs

Indexation

Index Status

  • site:domain.com check
  • Search Console coverage report
  • Compare indexed vs. expected

Indexation Issues

  • Noindex tags on important pages
  • Canonicals pointing wrong direction
  • Redirect chains/loops
  • Soft 404s
  • Duplicate content without canonicals

Canonicalization

  • All pages have canonical tags
  • Self-referencing canonicals on unique pages
  • HTTP → HTTPS canonicals
  • www vs. non-www consistency
  • Trailing slash consistency

Site Speed & Core Web Vitals

Core Web Vitals

  • LCP (Largest Contentful Paint): < 2.5s
  • INP (Interaction to Next Paint): < 200ms
  • CLS (Cumulative Layout Shift): < 0.1

Speed Factors

  • Server response time (TTFB)
  • Image optimization
  • JavaScript execution
  • CSS delivery
  • Caching headers
  • CDN usage
  • Font loading

Tools

  • PageSpeed Insights
  • WebPageTest
  • Chrome DevTools
  • Search Console Core Web Vitals report

Mobile-Friendliness

  • Responsive design (not separate m. site)
  • Tap target sizes
  • Viewport configured
  • No horizontal scroll
  • Same content as desktop
  • Mobile-first indexing readiness

Security & HTTPS

  • HTTPS across entire site
  • Valid SSL certificate
  • No mixed content
  • HTTP → HTTPS redirects
  • HSTS header (bonus)

URL Structure

  • Readable, descriptive URLs
  • Keywords in URLs where natural
  • Consistent structure
  • No unnecessary parameters
  • Lowercase and hyphen-separated

International SEO & Localization

Check when the site serves multiple languages or regions. Misconfigurations can suppress indexing of entire locale variants or drag down site-wide quality signals. See International SEO reference for evidence and source URLs.

Hreflang

Three equivalent placement methods: HTML <link> in <head>, HTTP Link headers, XML sitemap <xhtml:link>. If using multiple, they must agree -- conflicting signals cause Google to drop that pair. For 10+ locales, prefer sitemap-based (no page weight, no per-request cost).

Check for:

  • Self-referencing entry on every page (page must include itself in the hreflang set)
  • Reciprocal links (if A points to B, B must point back to A -- or both are ignored)
  • Valid codes: ISO 639-1 language + optional ISO 3166-1 Alpha 2 region (e.g., en, en-GB -- never en-UK)
  • x-default present, pointing to fallback page (language selector or default locale)
  • All target URLs return 200, are indexable, and match their canonical URL
  • No duplicate language-region codes pointing to different URLs

Common errors: Missing self-referencing entry (all hreflang ignored). No return tag / one-directional (pair dropped). Invalid codes like en-UK (use en-GB). Hreflang target is non-canonical, 404, or blocked (cluster discarded). HTML and sitemap annotations disagree (conflicting pair dropped).

At scale: <xhtml:link> children don't count toward 50K URL sitemap limit, but the 50MB file size limit becomes the bottleneck (plan 2K-5K URLs per file with full hreflang). Focus hreflang on pages receiving wrong-language traffic -- not required on every page. For Bing: supplement with <html lang> and <meta http-equiv="content-language"> (Bing treats hreflang as a weak signal).

Canonicalization for Multilingual Sites

  • Each locale page must self-canonical (e.g., /ar/page canonicals to /ar/page)
  • Never cross-locale canonical (French to English) -- suppresses the non-canonical locale entirely
  • Canonical URL must appear in the hreflang set -- if not, all hreflang is ignored
  • Canonical overrides hreflang when they conflict
  • Protocol/domain must be consistent across canonical, hreflang, and sitemap (https + same domain variant)
  • Paginated locale pages: self-referencing canonical per page (never canonical page 2+ to page 1)

Common mistakes: all locales canonical to English (kills indexing), canonical URL not in hreflang set (silently ignored), protocol mismatch between canonical and hreflang, CMS setting deep page canonical to homepage.

International Sitemaps

Check for:

  • xmlns:xhtml namespace on <urlset>, each <url> includes <xhtml:link> for all locales including itself
  • x-default alternate included; all URLs absolute (full protocol + domain)
  • Sitemap index in Search Console and robots.txt; split by content type, not by locale

Next.js caveat: alternates.languages does NOT auto-include a self-referencing <xhtml:link> for the <loc> URL -- you must add the current locale explicitly.

Locale URL Structure

Recommended: Subdirectories (/en/, /ar/). Acceptable: Subdomains or ccTLDs. Not recommended: URL parameters (?lang=en).

Check for:

  • Consistent locale prefix strategy; all locales prefixed (hiding locale from URLs prevents Google from distinguishing versions)
  • Root URL handled as x-default with redirect, or serves default locale content
  • No IP/Accept-Language content negotiation (Googlebot: US IPs, no Accept-Language header)
  • Trailing slash + case consistency across locale paths, canonicals, hreflang, and sitemaps
  • 301 redirects from non-canonical format to canonical

Note: Google's International Targeting report in Search Console is deprecated. Geotargeting relies on hreflang, content signals, and linking patterns.

Content Quality Across Locales

Translation quality:

  • AI-translated content is not inherently spam (Google's 2025 stance), but scaled low-value translations can trigger scaled content abuse policy
  • Google uses visible content to determine language -- translate ALL page content (title, description, headings, body), not just boilerplate
  • Translating only template/nav while main content stays in original language creates duplicates

Thin locale pages:

  • Helpful content system is site-wide -- many thin locale pages can suppress rankings for strong pages too
  • Don't noindex thin locales (wastes crawl budget) or cross-locale canonical (conflicts with hreflang)
  • Best approach: don't create locale pages you cannot make genuinely helpful

Check for:

  • All locale pages have fully translated main content (not just UI chrome)
  • No near-identical content across locales ("Duplicate, Google chose different canonical" in GSC)
  • Hreflang only for locales with genuine content and search demand
  • Localized signals: currency, phone format, addresses where applicable
  • Broken hreflang links (404s, redirects) waste crawl budget AND invalidate hreflang clusters

On-Page SEO Audit

Title Tags

Check for:

  • Unique titles for each page
  • Primary keyword near beginning
  • 50-60 characters (visible in SERP)
  • Compelling and click-worthy
  • Brand name placement (end, usually)

Common issues:

  • Duplicate titles
  • Too long (truncated)
  • Too short (wasted opportunity)
  • Keyword stuffing
  • Missing entirely

Meta Descriptions

Check for:

  • Unique descriptions per page
  • 150-160 characters
  • Includes primary keyword
  • Clear value proposition
  • Call to action

Common issues:

  • Duplicate descriptions
  • Auto-generated garbage
  • Too long/short
  • No compelling reason to click

Heading Structure

Check for:

  • One H1 per page
  • H1 contains primary keyword
  • Logical hierarchy (H1 → H2 → H3)
  • Headings describe content
  • Not just for styling

Common issues:

  • Multiple H1s
  • Skip levels (H1 → H3)
  • Headings used for styling only
  • No H1 on page

Content Optimization

Primary Page Content

  • Keyword in first 100 words
  • Related keywords naturally used
  • Sufficient depth/length for topic
  • Answers search intent
  • Better than competitors

Thin Content Issues

  • Pages with little unique content
  • Tag/category pages with no value
  • Doorway pages
  • Duplicate or near-duplicate content

Image Optimization

Check for:

  • Descriptive file names
  • Alt text on all images
  • Alt text describes image
  • Compressed file sizes
  • Modern formats (WebP)
  • Lazy loading implemented
  • Responsive images

Internal Linking

Check for:

  • Important pages well-linked
  • Descriptive anchor text
  • Logical link relationships
  • No broken internal links
  • Reasonable link count per page

Common issues:

  • Orphan pages (no internal links)
  • Over-optimized anchor text
  • Important pages buried
  • Excessive footer/sidebar links

Keyword Targeting

Per Page

  • Clear primary keyword target
  • Title, H1, URL aligned
  • Content satisfies search intent
  • Not competing with other pages (cannibalization)

Site-Wide

  • Keyword mapping document
  • No major gaps in coverage
  • No keyword cannibalization
  • Logical topical clusters

Content Quality Assessment

E-E-A-T Signals

Experience

  • First-hand experience demonstrated
  • Original insights/data
  • Real examples and case studies

Expertise

  • Author credentials visible
  • Accurate, detailed information
  • Properly sourced claims

Authoritativeness

  • Recognized in the space
  • Cited by others
  • Industry credentials

Trustworthiness

  • Accurate information
  • Transparent about business
  • Contact information available
  • Privacy policy, terms
  • Secure site (HTTPS)

Content Depth

  • Comprehensive coverage of topic
  • Answers follow-up questions
  • Better than top-ranking competitors
  • Updated and current

User Engagement Signals

  • Time on page
  • Bounce rate in context
  • Pages per session
  • Return visits

Common Issues by Site Type

SaaS/Product Sites

  • Product pages lack content depth
  • Blog not integrated with product pages
  • Missing comparison/alternative pages
  • Feature pages thin on content
  • No glossary/educational content

E-commerce

  • Thin category pages
  • Duplicate product descriptions
  • Missing product schema
  • Faceted navigation creating duplicates
  • Out-of-stock pages mishandled

Content/Blog Sites

  • Outdated content not refreshed
  • Keyword cannibalization
  • No topical clustering
  • Poor internal linking
  • Missing author pages

Multilingual / Multi-Regional Sites

  • Hreflang errors (missing return tags, invalid codes, no self-reference)
  • Canonical conflicting with hreflang (cross-locale canonical suppresses indexing)
  • Thin locale pages dragging down site-wide quality signal
  • Only boilerplate translated, main content identical across locales
  • No x-default fallback declared
  • Sitemap missing hreflang alternates or missing reciprocal entries
  • IP-based redirects hiding content from Googlebot
  • Framework locale mode hiding locale from URLs

Local Business

  • Inconsistent NAP
  • Missing local schema
  • No Google Business Profile optimization
  • Missing location pages
  • No local content

Output Format

Audit Report Structure

Executive Summary

  • Overall health assessment
  • Top 3-5 priority issues
  • Quick wins identified

Technical SEO Findings For each issue:

  • Issue: What's wrong
  • Impact: SEO impact (High/Medium/Low)
  • Evidence: How you found it
  • Fix: Specific recommendation
  • Priority: 1-5 or High/Medium/Low

On-Page SEO Findings Same format as above

Content Findings Same format as above

Prioritized Action Plan

  1. Critical fixes (blocking indexation/ranking)
  2. High-impact improvements
  3. Quick wins (easy, immediate benefit)
  4. Long-term recommendations

References

  • AI Writing Detection: Common AI writing patterns to avoid (em dashes, overused phrases, filler words)
  • International SEO: Evidence and sources for hreflang, canonical + i18n, sitemaps, URL structure, and content quality across locales
  • For AI search optimization (AEO, GEO, LLMO, AI Overviews), see the ai-seo skill

Tools Referenced

Free Tools

  • Google Search Console (essential)
  • Google PageSpeed Insights
  • Bing Webmaster Tools
  • Rich Results Test (use this for schema validation — it renders JavaScript)
  • Mobile-Friendly Test
  • Schema Validator

Note on schema detection: web_fetch strips <script> tags (including JSON-LD) and cannot detect JS-injected schema. Use the browser tool, Rich Results Test, or Screaming Frog instead — they render JavaScript and capture dynamically-injected markup. See the Schema Markup Detection Limitation section above.

Paid Tools (if available)

  • Screaming Frog
  • Ahrefs / Semrush
  • Sitebulb
  • ContentKing

Task-Specific Questions

  1. What pages/keywords matter most?
  2. Do you have Search Console access?
  3. Any recent changes or migrations?
  4. Who are your top organic competitors?
  5. What's your current organic traffic baseline?

Related Skills

  • ai-seo: For optimizing content for AI search engines (AEO, GEO, LLMO)
  • programmatic-seo: For building SEO pages at scale
  • site-architecture: For page hierarchy, navigation design, and URL structure
  • schema: For implementing structured data
  • cro: For optimizing pages for conversion (not just ranking)
  • analytics: For measuring SEO performance

coreyhaines31의 다른 스킬

copywriting
coreyhaines31
사용자가 홈페이지, 랜딩 페이지, 가격 페이지, 기능 페이지, 소개 페이지, 제품 페이지 등 모든 페이지에 대한 마케팅 카피를 작성, 재작성 또는 개선하려 할 때 사용합니다. 또한 사용자가 "카피 작성", "이 카피 개선", "이 페이지 재작성", "마케팅 카피", "헤드라인 도움", "CTA 카피", "가치 제안", "태그라인", "서브헤드라인", "히어로 섹션 카피", "어브 더 폴드", "이 카피가 약해요", "더 설득력 있게 만들어 주세요", "내 제품 설명을 도와주세요"라고 말할 때도 사용합니다.
marketingcreativecommunication
marketing-psychology
coreyhaines31
사용자가 심리학 원리, 정신 모델, 또는 행동 과학을 마케팅에 적용하고자 할 때 사용합니다. 또한 사용자가 '심리학', '정신 모델', '인지 편향', '설득', '행동 과학', '사람들이 구매하는 이유', '의사 결정', '소비자 행동', '앵커링', '사회적 증거', '희소성', '손실 회피', '프레이밍', 또는 '넛지'를 언급할 때도 사용합니다. 마케팅 맥락에서 사람들이 어떻게 생각하고 결정을 내리는지 이해하거나 활용하려는 경우에 사용하세요. 적용을 위해...
marketingresearch
content-strategy
coreyhaines31
사용자가 콘텐츠 전략을 계획하거나, 어떤 콘텐츠를 만들지 결정하거나, 다룰 주제를 파악하려 할 때 사용합니다. 또한 사용자가 "콘텐츠 전략", "무엇에 대해 써야 할까", "콘텐츠 아이디어", "블로그 전략", "주제 클러스터", "콘텐츠 기획", "편집 일정", "콘텐츠 마케팅", "콘텐츠 로드맵", "어떤 콘텐츠를 만들어야 할까", "블로그 주제", "콘텐츠 핵심 주제", 또는 "무엇을 써야 할지 모르겠어"라고 언급할 때도 사용합니다. 누군가 어떤 콘텐츠를 만들지 결정하는 데 도움이 필요할 때마다 사용하세요.
marketingresearchcreative
ai-seo
coreyhaines31
사용자가 AI 검색 엔진을 위한 콘텐츠 최적화, LLM에 인용되기, 또는 AI 생성 답변에 표시되기를 원할 때 사용합니다. 또한 사용자가 'AI SEO', 'AEO', 'GEO', 'LLMO', 'answer engine optimization', 'generative engine optimization', 'LLM optimization', 'AI Overviews', 'optimize for ChatGPT', 'optimize for Perplexity', 'AI citations', 'AI visibility', 'zero-click search', 'how do I show up in AI answers', 'LLM mentions', 또는 'optimize for Claude/Gemini'를 언급할 때도 사용합니다. 누군가가... 할 때마다 사용하세요.
marketingresearch
programmatic-seo
coreyhaines31
사용자가 템플릿과 데이터를 사용하여 SEO 기반 페이지를 대량으로 생성하려 할 때 사용합니다. 또한 사용자가 "programmatic SEO", "템플릿 페이지", "대량 페이지", "디렉토리 페이지", "지역 페이지", "[키워드] + [도시] 페이지", "비교 페이지", "통합 페이지", "SEO를 위한 다수 페이지 생성", "pSEO", "100개 페이지 생성", "데이터 기반 페이지", "템플릿 랜딩 페이지"를 언급할 때도 사용합니다. 다양한 키워드나 위치를 대상으로 유사한 페이지를 많이 만들고자 할 때 이 기능을 사용하세요. 예를 들어...
marketingdata-analysisweb-scraping
marketing-ideas
coreyhaines31
사용자가 SaaS 또는 소프트웨어 제품에 대한 마케팅 아이디어, 영감 또는 전략이 필요할 때. 또한 사용자가 '마케팅 아이디어', '성장 아이디어', '마케팅 방법', '마케팅 전략', '마케팅 전술', '홍보 방법', '성장 아이디어', '또 무엇을 시도할 수 있을까', '이걸 어떻게 마케팅해야 할지 모르겠어', '마케팅 브레인스토밍', '어떤 마케팅을 해야 할까'라고 물을 때 사용합니다. 누군가 막혀 있거나 성장을 위한 영감을 찾고 있을 때 출발점으로 사용하세요. 구체적인...
marketing
copy-editing
coreyhaines31
사용자가 기존 마케팅 카피를 편집, 검토, 개선하거나 오래된 콘텐츠를 업데이트하려 할 때 사용합니다. 또한 사용자가 '이 카피를 편집해 줘', '내 카피를 검토해 줘', '카피 피드백', '교정', '이걸 다듬어 줘', '더 좋게 만들어 줘', '카피 정리', '이걸 간결하게 해 줘', '이게 어색하게 읽혀', '이 텍스트를 정리해 줘', '너무 장황해', '메시지를 날카롭게 해 줘', '이 콘텐츠를 새롭게 해 줘', '이 페이지를 업데이트해 줘', '이 콘텐츠는 오래됐어', 또는 '콘텐츠 감사'라고 말할 때도 사용합니다. 사용자가 이미 카피를 가지고 있고
documentcommunicationmarketing
cro
coreyhaines31
사용자가 홈페이지, 랜딩 페이지, 가격 페이지, 기능 페이지, 리드 캡처 양식, 문의 양식 등 모든 마케팅 페이지나 양식의 전환을 최적화, 개선, 또는 증가시키고자 할 때 사용합니다. 또한 사용자가 'CRO', '전환율 최적화', '이 페이지가 전환되지 않음', '전환 개선', '이 페이지가 왜 작동하지 않나요', '내 랜딩 페이지 형편없음', '양식 이탈', '아무도 전환하지 않음', '낮은 전환율', 또는 '이 페이지 수정 필요'라고 말할 때도 사용합니다. 사용자가...
marketingdata-analysisresearch