SKILL.md packages that extend Claude Code, Cursor, Copilot, and other AI agents.
Tags
argentos-core
Comprehensive guide for fine-tuning LLMs using TRL, covering SFT, DPO, PPO, and GRPO for human preference alignment.