
from awesome-copilot34,827
Create, run, and analyze Arize experiments to evaluate and compare model performance using the ax CLI.
Provides step-by-step guidance and CLI workflows for creating, exporting, running, and comparing Arize experiments. It covers dataset export, running inference to produce runs, exporting results, and comparing evaluation metrics to benchmark and A/B test models. Includes clear instructions for using the ax CLI to list/get/export experiments and templates for piping experiment exports into inference scripts.
Use this skill when you need to evaluate model performance with Arize: creating experiments, exporting runs, running bulk inference over dataset examples, comparing two experiments, or extracting metrics for analysis. Trigger when the user mentions experiments, benchmarks, A/B testing models, model evaluation, exporting runs, or using the ax CLI.
Works with agents that can run shell commands and invoke provider SDKs (OpenAI, Anthropic, Google Gemini, custom OpenAI-compatible proxies).
Arize Experiment skill provides comprehensive CLI-driven workflow for creating, running, and comparing ML model experiments via the ax CLI. The SKILL.md is exceptionally well-documented with clear command examples, flag tables, troubleshooting guides, and anti-fabrication safeguards. No bundled scripts to execute. Security posture is strong — explicitly prohibits credential exfiltration and output fabrication.
Part of the awesome-copilot repo under plugins/arize-ax. Well-maintained, security-conscious documentation with explicit warnings against fabricating outputs and reading .env files. No scripts included — purely instructional SKILL.md.
Copilot Instructions Blueprint Generator
Generates a technology-agnostic blueprint for creating copilot-instructions.md files that align Copilot output with a project's exact architecture, versions, an
quality-playbook
Run a complete quality engineering audit on any codebase. Derives behavioral requirements from the code, generates spec-traced functional tests, runs a three...
FlowStudio Power Automate Builder
Build, scaffold, deploy, and verify Power Automate cloud flows programmatically via a FlowStudio MCP server; handles connection discovery, definition building,