Back to Skills

Relax RL Dev & Debug

Name: Relax RL Dev & Debug
Author: deepexperience

from deepexperience

Develop and debug the Relax reinforcement learning project on remote Ray clusters.

triggers:trainingdebug trainingRay jobsubmit Ray jobfixing training errorsrelax/ directory

Relax RL Development & Debugging

This skill provides a comprehensive workflow for iterating on the Relax reinforcement learning project, from local code modification to remote validation on Ray clusters.

Key Capabilities:

Minimum-Change Development: Guidance on applying the smallest possible diffs to preserve code style and stability.
Remote Training Validation: Detailed steps for submitting training jobs to Ray clusters using scripts/entrypoint/ray-job.sh.
Job Monitoring & Filtering: Advanced ray job logs usage with mandatory noise filters to save tokens and surface critical errors (CUDA OOM, tracebacks).
Cluster Management: Instructions for preparing TorchJob clusters (single and multi-node) and cleaning up stale Ray Serve applications.
GenRM Validation: Dedicated workflows for validating GenRM (LLM-as-judge) configurations.

Not yet audited

This skill has not been reviewed by our automated audit pipeline yet.

Information

Repository: deepexperience

Related Skills

Synalinks Framework

Keras-inspired framework for building structured, neuro-symbolic LLM programs with DataModel schemas, modular Programs, and training/optimization tools.

Runtime Communication (research_mvp)

Rules and workflows for messaging, delegation, and task coordination in the research_mvp local multi-agent runtime (leader, researcher, trainer).

Deliberate Practice

Guided framework for accelerating skill acquisition using focused practice, immediate feedback, and progressive challenge—useful for learning technical skills,

Relax: Development & Remote Training Debugging

Tools and procedures to develop the Relax project and validate changes by submitting and monitoring remote Ray training jobs (non-blocking, debug-friendly).

CKB Standalone Debugger

Repository-specific skill for offline CKB contract debugging: run mock transactions, GDB sessions, flamegraph/coverage analysis, and WASM-compatible runs for co

KnowBe4 (Membrane)

Integrate with KnowBe4 via the Membrane CLI to manage users, phishing and training campaigns, groups, reports, and account settings.

Raymon — Ray-style log store & MCP server

Lightweight searchable Ray-compatible log server (Raymon) exposing an MCP interface for agents to search, list, and fetch full log entries for debugging and tri

Learning Opportunities

Adds short interactive learning exercises during AI-assisted coding to help developers understand design decisions and new code before merging.

MCP App Store

Relax RL Dev & Debug

Relax RL Development & Debugging

Key Capabilities:

Tags

Not yet audited

Information

Related Skills

Relax RL Dev & Debug

Relax RL Development & Debugging

Key Capabilities:

Tags

Not yet audited

Information

Related Skills