QA Skills MCP Server

43 QA and test-automation skills for Claude Code, Codex, Cursor, and any Agent Skills Standard runtime.

Documentation

QA Skills for AI Agents

QA and test automation skills for Claude Code, Codex, Cursor, Gemini CLI, VS Code, and other agents that support the Agent Skills Standard.

Website: qa-skills.com

npx skills add petrkindlmann/qa-skills

43 skills covering:

  • Playwright and Cypress automation
  • API, unit, and mobile testing
  • Test strategy and risk-based planning
  • CI/CD integration and test environments
  • Accessibility, security, and compliance
  • Visual regression and performance testing
  • AI-assisted test generation and bug triage
  • Chaos engineering and observability
  • Coverage analysis and QA dashboards

Based on patterns from QA automation work across 6+ production sites.

Skills Website License: MIT Agent Skills Standard Playwright


Quick start

Option 1: Install specific skills (recommended)

npx skills add petrkindlmann/qa-skills playwright-automation test-strategy

Option 2: Clone the full repo

git clone https://github.com/petrkindlmann/qa-skills.git .skills/qa-skills

Option 3: Add as a git submodule

git submodule add https://github.com/petrkindlmann/qa-skills.git .skills/qa-skills

Option 4: Manual download

Download individual skill folders from skills/ and place them in your project's .skills/ directory.


Usage

Ask your AI agent in natural language. The right skill activates automatically.

You saySkill activated
"Write Playwright tests for our checkout flow"playwright-automation
"Create a QA strategy for this project"test-strategy
"Generate tests from this PRD"ai-test-generation
"This bug keeps happening in prod, classify and triage it"ai-bug-triage
"Set up test reporting in GitHub Actions"ci-cd-integration + qa-metrics
"What should we test before this release?"release-readiness
"Run a visual regression check on the homepage"visual-testing
"Load test our API with 1000 concurrent users"performance-testing
"Check our app for OWASP Top 10 vulnerabilities"security-testing
"Set up synthetic monitoring for critical flows"synthetic-monitoring

Skill categories

43 skills across 10 categories. Each skill keeps a lean SKILL.md (heavy code offloaded to references/, loaded on demand) with cross-references to related skills.

Foundation (1)

qa-project-context — project context template that every other skill reads first

Strategy (4)

test-strategy · test-planning · risk-based-testing · exploratory-testing — QA strategy, sprint/release test plans, risk-based prioritization, session-based exploratory testing

Automation (8)

playwright-automation · cypress-automation · selector-drift-recovery · api-testing · unit-testing · mobile-testing · visual-testing · performance-testing — Playwright E2E, Cypress component/E2E, bulk post-refactor selector regeneration, REST/GraphQL, Jest/Vitest/pytest, Appium/Detox, visual regression, k6 load testing and Lighthouse CI

Specialized (4)

accessibility-testing · security-testing · cross-browser-testing · database-testing — WCAG compliance, OWASP Top 10, analytics-driven browser matrices, migration testing and schema validation

AI-augmented QA (4)

ai-test-generation · ai-bug-triage · test-reliability · ai-qa-review — LLM test generation, automated bug triage, flaky test management, test quality review

Infrastructure (5)

ci-cd-integration · test-environments · test-data-management · contract-testing · service-virtualization — CI/CD pipelines, environment strategy, data factories, Pact consumer-driven contracts, WireMock/MSW

Metrics (3)

qa-metrics · qa-dashboard · coverage-analysis — quality gates and KPIs, Allure/Grafana/ReportPortal dashboards, coverage-as-ratchet in CI

Process (6)

shift-left-testing · qa-project-bootstrap · release-readiness · quality-postmortem · compliance-testing · qa-report-humanizer — shift-left patterns, QA onboarding, go/no-go checklists, blameless postmortems, GDPR/CMP compliance, QA report humanization

Production and observability (3)

testing-in-production · synthetic-monitoring · observability-driven-testing — feature flag validation, scheduled synthetic probes, trace-based test assertions

Knowledge and migration (3)

ai-system-testing · chaos-engineering · test-migration — LLM/AI feature testing, controlled fault injection, framework migration guides (Selenium/Cypress/Jest to modern stacks)


Full skills table

SkillDescriptionCategory
qa-project-contextProject context template covering tech stack, test frameworks, CI/CD, environments, quality goals. Every other skill reads this first.Foundation
test-strategyQA strategy creation with risk-based prioritization, test pyramid design, entry/exit criteria, and tool selection rationale.Strategy
test-planningSprint and release test plans. Feature decomposition, requirements-to-test mapping, effort estimation, resource allocation.Strategy
risk-based-testingRisk assessment matrices, priority-based test selection, impact/likelihood analysis, regression risk scoring.Strategy
exploratory-testingSession-Based Test Management (SBTM), charter writing, heuristic-based exploration (HICCUPS, FEW HICCUPS), debrief templates.Strategy
playwright-automationPlaywright E2E testing. Page Object Model, fixtures, parallel execution, API mocking, visual comparisons, CI integration.Automation
cypress-automationCypress test suites with component testing, E2E testing, custom commands, cy.intercept, Cypress Cloud, and TypeScript support.Automation
selector-drift-recoveryBulk-regenerate broken test selectors after a UI refactor. Snapshot old vs new DOM, map locators with confidence scores, ship a single PR.Automation
api-testingREST and GraphQL testing with schema validation, contract testing patterns, auth flows, and response assertions.Automation
unit-testingJest, Vitest, and pytest patterns. Mocking strategies, coverage thresholds, snapshot testing, test doubles taxonomy.Automation
mobile-testingMobile testing with Appium 2.0 and Detox for React Native. Device farms, gesture simulation, deep link testing, push notifications.Automation
visual-testingVisual regression testing with Playwright screenshots, Chromatic, Percy, and Argos CI. Baseline management, diff thresholds, dynamic content masking.Automation
performance-testingk6 load/stress/soak scripts, Lighthouse CI for Web Vitals, performance budgets as CI gates.Automation
accessibility-testingWCAG 2.1 compliance testing, axe-core integration, screen reader testing, keyboard navigation, color contrast validation.Specialized
security-testingOWASP Top 10 vulnerability testing. ZAP integration, dependency scanning (Snyk/Dependabot), SAST with ESLint security plugins.Specialized
cross-browser-testingAnalytics-driven browser test matrices, BrowserStack/Sauce Labs configuration, Playwright browser channels, common rendering issues.Specialized
database-testingDatabase integrity validation, migration testing (forward/backward), schema constraints, seed data management, query performance.Specialized
ai-test-generationGenerate tests from specs, PRDs, and user stories using a staged pipeline with guardrails. Coverage matrix before code.AI-QA
ai-bug-triageClassify bugs by severity/component/root cause. Deduplicate issues, analyze CI failures, generate tickets.AI-QA
test-reliabilityLocator resilience, flaky test classification by root cause, quarantine management, confidence-scored auto-repair.AI-QA
ai-qa-reviewTest quality review across five smell dimensions. Coverage gap detection, testability analysis, anti-pattern detection.AI-QA
ci-cd-integrationGitHub Actions and GitLab CI pipeline templates. Parallelism, artifact management, flaky test quarantine, test result publishing.Infrastructure
test-environmentsEnvironment strategy for dev, staging, preview, and production. Docker Compose, seed data, environment parity, cleanup.Infrastructure
test-data-managementTest data with factories, fixtures, synthetic data generation, database seeding, data cleanup, environment isolation.Infrastructure
contract-testingConsumer-driven contract testing with Pact.js. Consumer tests, provider verification, Pact Broker, can-i-deploy gates.Infrastructure
service-virtualizationDependency isolation decision framework. Mocks, stubs, fakes, record-replay, WireMock, MSW (Mock Service Worker).Infrastructure
qa-metricsQA metrics with formulas: coverage %, flakiness rate, defect escape rate, MTTR, test execution trends, quality gates.Metrics
qa-dashboardQA dashboards with Allure Report, Grafana, and ReportPortal. Test execution visualization, trend analysis, stakeholder reports.Metrics
coverage-analysisCoverage measurement with Istanbul/V8/c8/coverage.py. Gap analysis, coverage-as-ratchet in CI, meaningful vs vanity coverage.Metrics
shift-left-testingMove quality earlier. Dev/QA pairing, Three Amigos, TDD facilitation, PR review checklists, pre-merge quality gates.Process
qa-project-bootstrapOnboard a QA engineer to an existing codebase. First 30 days checklist, test architecture audit, framework walkthrough, mentorship patterns.Process
release-readinessGo/no-go checklists, smoke test suite design, rollback criteria, staged rollout validation, release sign-off.Process
quality-postmortemBlameless postmortems for escaped defects. Bug pattern analysis, 5 Whys root cause analysis, test gap identification.Process
compliance-testingRegulatory compliance testing. GDPR/CMP consent verification, Better Ads Standards, cookie auditing, privacy policy validation.Process
qa-report-humanizerRemove AI patterns from QA reports, bug reports, test summaries, and status updates. Makes QA writing sound like a real engineer wrote it.Process
testing-in-productionProduction validation with feature flags, progressive rollouts, canary analysis, guardrail metrics, production smoke tests.Production
synthetic-monitoringPost-deploy validation via scheduled synthetic tests. Probe design for critical user journeys, alert thresholds, SLA tracking.Production
observability-driven-testingUse traces, logs, and telemetry as test evidence. OpenTelemetry integration, trace-based assertions, log-informed test design.Production
ai-system-testingTest AI features. LLM prompt regression, tool call validation, nondeterministic output evaluation, hallucination risk assessment.Knowledge
chaos-engineeringControlled fault injection. Hypothesis-driven chaos experiments, network/service/infrastructure failure injection, blast radius control.Knowledge
test-migrationIncremental test suite migration. Selenium to Playwright, Cypress to Playwright, Jest to Vitest, Mocha to Jest, Protractor to Playwright.Knowledge

AI-augmented QA

Most QA skills repos stop at framework tutorials. This one also covers how AI agents can help with the testing itself.

SkillWhat it does
ai-test-generationGenerate test cases from PRDs, specs, and user stories. Builds a coverage matrix before writing code.
ai-bug-triageClassify bugs by severity/component/root cause, deduplicate issues, analyze CI failures
test-reliabilityPer-test runtime healing: detect broken locators, score selector stability, quarantine flaky tests, classify root causes
selector-drift-recoveryBulk offline regeneration: snapshot old vs new DOM after a UI refactor, ship one PR with per-change evidence
ai-qa-reviewTest quality review. Coverage gap detection, test smell identification, testability analysis

Compatibility

Works with any tool that supports the Agent Skills Standard.

AgentSupportInstall method
Claude CodeNativenpx skills add or clone
OpenAI CodexNativeClone or submodule
CursorNativeClone to .cursor/skills
Gemini CLINativeClone or submodule
VS Code CopilotVia instructionsClone and reference in settings
OpenCodeNativeClone or submodule
WindsurfNativeClone or submodule
Any Agent Skills-compatible toolStandardClone or submodule

Tools registry

Skills reference specific QA tools for implementation. The Tools Registry lists all supported tools with capabilities, MCP server availability, and integration guides.

Covered: test frameworks, reporting, visual testing, CI/CD, project management, security scanning, observability.


Project context

Every skill checks for .agents/qa-project-context.md before asking discovery questions. This file captures your project's QA setup: tech stack, test frameworks, CI/CD, environments, quality goals, risk areas, team structure.

Create it once using the qa-project-context skill. All other skills read it automatically.


Contributing

See CONTRIBUTING.md.

Star to follow new skills and updates.


License

MIT. See LICENSE.


This project is not affiliated with, endorsed by, or sponsored by Microsoft, Cypress, Grafana Labs, BrowserStack, Sauce Labs, or any other referenced vendor. Product names and brands are the property of their respective owners.