This skill defines actionable observability standards for deployed services. It guides instrumenting metrics (RED/USE), implementing distributed tracing with OpenTelemetry, exposing liveness/readiness checks, and defining SLIs/SLOs and dashboard panels. Use it to design or review operational observability for services in a microservices or monolith environment.
Use this skill when creating or reviewing service instrumentation, building dashboards, or setting SLOs. It's appropriate during design, code review, and oncall runbooks to ensure metrics, traces and health checks are implemented coherently.
Best used by agents with code-review and DevOps capabilities (agents that can read repo files and suggest instrumentation), and those that can run or suggest OpenTelemetry/prometheus/Grafana configurations.
This skill has not been reviewed by our automated audit pipeline yet.