azure-diagnostics
작성자: Azure
Azure에서 프로덕션 문제를 디버깅하고 문제를 해결합니다. Container Apps 및 Function Apps 진단, KQL을 사용한 로그 분석, 상태 확인, 일반적인 문제 해결을 다룹니다.
npx skills add https://github.com/microsoft/GitHub-Copilot-for-Azure --skill azure-diagnosticsAzure Diagnostics
AUTHORITATIVE GUIDANCE — MANDATORY COMPLIANCE
This document is the official source for debugging and troubleshooting Azure production issues. Follow these instructions to diagnose and resolve common Azure service problems systematically.
Triggers
Activate this skill when user wants to:
- Debug or troubleshoot production issues
- Diagnose errors in Azure services
- Analyze application logs or metrics
- Fix image pull, cold start, or health probe issues
- Investigate why Azure resources are failing
- Find root cause of application errors
- Troubleshoot App Service issues (high CPU, deployment failures, crashes, slow responses, TLS/custom domains)
- Respond to prompts like "troubleshoot app service", "app service high CPU", or "app service deployment failure"
- Troubleshoot Azure Function Apps (invocation failures, timeouts, binding errors)
- Find the App Insights or Log Analytics workspace linked to a Function App
- Troubleshoot AKS clusters, nodes, pods, ingress, or Kubernetes networking issues
- Troubleshoot Azure Messaging SDK issues (Event Hubs, Service Bus connection failures, AMQP errors, message lock issues)
Rules
- Start with systematic diagnosis flow
- Use AppLens (MCP) for AI-powered diagnostics when available
- Check resource health before deep-diving into logs
- Select appropriate troubleshooting guide based on service type
- Document findings and attempted remediation steps
- Route AKS incidents to the dedicated AKS troubleshooting document
Quick Diagnosis Flow
- Identify symptoms - What's failing?
- Check resource health - Is Azure healthy?
- Review logs - What do logs show?
- Analyze metrics - Performance patterns?
- Investigate recent changes - What changed?
Troubleshooting Guides by Service
| Service | Common Issues | Reference |
|---|---|---|
| Container Apps | Image pull failures, cold starts, health probes, port mismatches | container-apps/ |
| App Service | High CPU, deployment failures, crashes, slow responses, TLS/custom domains | app-service/ |
| Function Apps | App details, invocation failures, timeouts, binding errors, cold starts, missing app settings | functions/ |
| AKS | Cluster access, nodes, kube-system, scheduling, crash loops, ingress, DNS, upgrades | AKS Troubleshooting |
| Messaging | Event Hubs & Service Bus SDK errors, AMQP failures, message lock, connectivity | Messaging Troubleshooting |
Routing
- Keep Container Apps and Function Apps diagnostics in this parent skill.
- Route active AKS incidents, AKS-specific intake, evidence gathering, and remediation guidance to AKS Troubleshooting.
- Route Azure Messaging SDK troubleshooting (Event Hubs, Service Bus) to Messaging Troubleshooting.
Quick Reference
Common Diagnostic Commands
# Check resource health
az resource show --ids RESOURCE_ID
# View activity log
az monitor activity-log list -g RG --max-events 20
# Container Apps logs
az containerapp logs show --name APP -g RG --follow
# Function App logs (query App Insights traces)
az monitor app-insights query --apps APP-INSIGHTS -g RG \
--analytics-query "traces | where timestamp > ago(1h) | order by timestamp desc | take 50"
AppLens (MCP Tools)
For AI-powered diagnostics, use:
mcp_azure_mcp_applens
intent: "diagnose issues with <resource-name>"
command: "diagnose"
parameters:
resourceId: "<resource-id>"
Provides:
- Automated issue detection
- Root cause analysis
- Remediation recommendations
Azure Monitor (MCP Tools)
For querying logs and metrics:
mcp_azure_mcp_monitor
intent: "query logs for <resource-name>"
command: "logs_query"
parameters:
workspaceId: "<workspace-id>"
query: "<KQL-query>"
See kql-queries.md for common diagnostic queries.
Check Azure Resource Health
Using MCP
mcp_azure_mcp_resourcehealth
intent: "check health status of <resource-name>"
command: "get"
parameters:
resourceId: "<resource-id>"
Using CLI
# Check specific resource health
az resource show --ids RESOURCE_ID
# Check recent activity
az monitor activity-log list -g RG --max-events 20
References
Azure의 다른 스킬
azure-ai
Azure
Azure AI: Search, Speech, OpenAI, Document Intelligence에 사용됩니다. 검색, 벡터/하이브리드 검색, 음성-텍스트 변환, 텍스트-음성 변환, 전사, OCR을 지원합니다.
appinsights-instrumentation
Azure
Azure Application Insights를 사용하여 웹앱을 계측하기 위한 가이드입니다. 원격 분석 패턴, SDK 설정 및 구성 참조를 제공합니다.
azure-aigateway
Azure
Azure API Management(APIM)을 AI 게이트웨이로 구성하여 AI 모델, MCP 서버, 에이전트를 보호, 관찰, 제어합니다. 속도 제한, 의미론적 캐싱, 콘텐츠 안전, 로드 밸런싱에 도움이 됩니다.
azure-compliance
Azure
포괄적인 Azure 규정 준수 및 보안 감사 기능으로, 모범 사례 평가, Key Vault 만료 모니터링, 리소스 구성 검증을 포함합니다.
azure-compute
Azure
워크로드 요구 사항, 성능 요구 사항 및 예산 제약 조건에 따라 Azure VM 크기, VM Scale Sets(VMSS) 및 구성을 추천합니다.
azure-cost-optimization
Azure
Azure 구독 전반에서 실제 비용, 사용률 메트릭을 분석하여 비용 절감 기회를 식별하고 정량화하며, 실행 가능한 최적화 권장 사항을 생성합니다.
azure-deploy
Azure
Azure에 배포를 실행합니다. 준비 및 검증 후 최종 단계입니다. azd up, azd deploy 또는 인프라 프로비저닝 명령을 실행합니다.
azure-hosted-copilot-sdk
Azure
GitHub Copilot SDK 앱을 Azure에 빌드하고 배포합니다.