SKILL.md packages that extend Claude Code, Cursor, Copilot, and other AI agents.
Tags

orchestkit
Query local, privacy-safe cross-project analytics to report on agent, skill, hook, and team performance; replay sessions and estimate token costs.

opentelemetry-skill
Expert OpenTelemetry observability skill for designing collectors, pipelines, sampling, cardinality management, security, and production-ready deployment patter

datus-agent
Interactively define MetricFlow metrics from natural-language business descriptions; proposes, validates, and dry-runs metric YAML for semantic modeling.

ai-playground
Scans a repository, discovers and runs tests, computes coverage, evaluates test quality, and generates a metrics report website and JSON output.

claude-plugins
Evaluation framework and tools for systematically measuring LLM performance using automated metrics, human judgment, and A/B testing.

skills
Practical, cross‑language playbook to design and scale SaaS growth (PLG vs SLG), pricing, retention, and key metrics (ARR, NRR, CAC, LTV) with benchmarks and ca

swarma
Framework for running high-velocity growth experiments with agent teams that generate hypotheses, run tests, observe signals, and build a validated playbook.

awesome-copilot
Create, run, and analyze Arize experiments to evaluate and compare model performance using the ax CLI.

vitaclaw
Analyze and track body composition metrics (body fat, muscle mass, visceral fat, BMI) and provide evidence-based recommendations and trend analysis.

maverick
Practical observability standards: metrics, tracing, health checks, SLIs/SLOs and dashboards for production services.

opendatalab
Run, validate, and parse OmniDocBench document parsing evaluations with Docker/conda workflows and result parsing.

opencode-skills-collection
Diagnose and optimize Agent Skills (SKILL.md) using session transcripts and static analysis to improve triggers, workflows, and token efficiency.

agent-almanac
Set up MLflow experiment tracking: server, autologging, artifact storage, run comparison and lifecycle management for reproducible ML workflows.

dotfiles-claude
Practical guide to quantitative code-quality metrics (cyclomatic, cognitive, Halstead, maintainability index) with thresholds, formulas and measurement commands

ai-first-sdlc-practices
Generates a markdown dashboard of knowledge-base health: inventory, layer/domain distribution, recent activity, and staleness metrics.