Evolve — Goal-Driven Improvement Loop

Name: Evolve — Goal-Driven Improvement Loop
Rating: 81 (1 reviews)
Author: boshu2

Trust Score 81/100

Autonomous improvement loop that selects high-value work, runs a full research-plan-implement (/rpi) cycle, validates changes, and repeats to compound repo impr

triggers:evolveautonomous improvementrun queuepostmortem and continueimprove everything

GitHub SKILL.md

What it does

Evolve provides a fully autonomous, operator-driven improvement loop: it selects the highest-value work (pinned queues, harvested findings, failing goals, testing gaps), runs a research/planning/implement/validate (/rpi) cycle, applies a regression gate, and repeats until a stop condition. It's designed to iteratively harden and improve codebases by automating selection, execution, and validation.

When to use it

Use Evolve when you want continuous, supervised autonomous development: landing bug fixes, tightening validation, generating tests, performing audits, or processing a roadmap (.agents/evolve/roadmap). It is suitable for day-time operator runs where code changes are expected and validated each cycle.

What's included

Scripts: included integration with /rpi lifecycle and repository-level scripts (has_scripts=true)
References: comprehensive docs and references folder covering fitness scoring, pinned-queue patterns, and teardown (has_references=true)
Instructions: detailed step-by-step execution of selection ladder, execution, regression gate, and teardown; guidance on flags like --compile, --dry-run, --beads-only.

Compatible agents

Best fit for agent platforms that support code changes and lifecycle tooling (AgentOps CLI, RPI engine). Commonly used with Claude Code, Codex, Cursor and similar operator-capable agents.

Audit Summary

Evolve is a goal-driven autonomous improvement loop skill for the AgentOps ecosystem. It orchestrates a full research-plan-implement cycle per iteration with fitness scoring, regression gates, and work selection ladders. The SKILL.md is exceptionally detailed with 7 execution steps, 10+ reference docs, and comprehensive flag support. The bundled validate.sh script failed entirely in the sandbox because it resolves SKILL_DIR relative to the script location but runs in an isolated temp directory where the skill files don't exist — a path-handling bug that would also affect real-world execution outside the skill's own repo.

Watch Out

validate.sh path resolution breaks outside the skill's own repo directory
Requires AgentOps CLI (ao), /rpi skill, /dream skill, and beads system — heavy ecosystem dependency
No guard against running on repos without GOALS.yaml or .agents/ structure

Notes

Well-architected skill with thorough documentation and clear operational contracts. The validate.sh script has a fundamental path resolution issue — it uses dirname-based SKILL_DIR resolution which breaks when the script is copied/run from a different location. Shell injection risk in validate.sh via unquoted variable expansion in bash -c strings. The skill is tightly coupled to the AgentOps ecosystem (ao CLI, beads, /rpi) which limits standalone usability significantly.

Information

Repository: boshu2

Trust Score

Overall81

Security88

Code Quality72

Architecture78

Usefulness55

More from boshu2

RPI — Full Lifecycle Orchestrator

Orchestrates a three-phase RPI lifecycle (discovery → implementation → validation) for autonomous agent workflows, including complexity classification, routing,

Beads BV — Graph-Aware Triage

Graph-based backlog triage using bv/br: rank priorities, find bottlenecks, and produce machine-readable recommendations for agents (robot mode).

cass Session Search

Mine past agent sessions for working prompts, decisions, rituals and recovery patterns (operational tooling).

PR Preparation & Validation

Systematically prepares pull requests by validating tests, analyzing commit patterns, and generating high-quality, structured PR bodies.

Operationalize

Distill research and learnings into durable automation rules.

Related Skills

Development Worktree

Create an isolated git worktree for feature work, auto-run project setup, and verify a clean test baseline before development.

Readwise Reader Document Management

Manage Readwise Reader documents: list, save, search, move, tag, highlight, export and bulk-edit via official and custom CLIs.

Bounty Hunter — Atlas

Persona skill: 'Atlas' — a profit-focused developer persona for discovering, evaluating and executing paid bounties or freelance tasks with ROI-aware workflows.

Junshi — Research Advisor

Daily strategic research advisor that scans arXiv/venues, digests papers, and proposes bold, ranked research ideas tailored to the user's profile.

Full Stack Builder

End-to-end builder that scaffolds, implements, tests, and optionally deploys web and API applications from a natural-language specification.

ezBookkeeping API Tools

Command-line API tools for ezBookkeeping: record and query transactions, retrieve accounts/categories/tags, and fetch exchange rates for self-hosted personal fi

Feishu Voice Sender

Convert MP3s and send them as native Feishu voice messages (playable voice clips) to users or groups.

Claw Bench

Benchmarking skill that guides an agent through a structured suite of capability tests and reporting steps for leaderboard submission.

Back to Skills