SKILL.md packages that extend Claude Code, Cursor, Copilot, and other AI agents.
Tags
openjudge
Tools and patterns to build automated evaluation pipelines for LLMs: graders, runners, aggregators, and analysis utilities for comparing model outputs and scori