AKO4ALL — Agentic Kernel Optimization

Automated loop that profiles, iterates, benchmarks and commits GPU kernel optimizations across CUDA/Triton/TileLang/Python/C++ to achieve measurable speedups.

triggers:optimize this kernelAKOAKO4ALLncubenchmark kernelspeed up CUDAkernel optimization

GitHub SKILL.md

What it does

AKO4ALL runs an agentic optimize→benchmark→log→commit workflow for GPU kernels. Given a kernel (CUDA, Triton, TileLang, C++, Python), it bootstraps a workspace, profiles the code, runs iterative micro-optimizations, verifies correctness, and records each iteration. The goal is measurable runtime improvement over a provided reference while preserving correctness and reproducible commits.

When to use it

Use AKO4ALL when you have an existing GPU kernel you want to speed up and measure: 'optimize this kernel', 'run AKO on my CUDA kernel', or when you need repeatable iteration with profiling (ncu) and git-backed experiment logging. It is not for writing a kernel from scratch or for generic non-kernel performance advice.

What's included

Scripts: scaffolded bench-wrapper and scripts/bench.sh patterns (bench/kernelbench integration) when present in the repo; the skill provides guidance to render and run bench commands.
References: guidance to use ncu profiling and KernelBench-style inputs; the skill explains HINTS.md and ITERATIONS.md usage for persistent directives and experiment logs.
Instructions: detailed procedural workflow covering workspace inventory, branch/solution initialization, bench command generation, iteration protocol (bench → ITERATIONS.md entry → commit), profiling guidance, stall handling and finalization steps.

Compatible agents

Best used by agents with code execution and shell capabilities (Copilot/Codex-style or CLI-capable agents that can run Python, shell, and profiling tools). The skill expects the environment to run benchmarks, invoke ncu, and commit to git.

Not yet audited

This skill has not been reviewed by our automated audit pipeline yet.

Information

Repository: ako4all
Stars: 262

Related Skills

OpenTestAI

Automated, high-confidence AI testing: bug detection, persona feedback, and prioritized test-case generation using many specialized tester profiles.

Go Data Structures

Authoritative guidance on choosing and using Go built-in and standard-library data structures, with practical best practices for slices, maps, arrays, container

React Development Expert

Provides authoritative React guidance on hooks, state patterns, Server Components, performance optimization, and common architectural patterns.

Code Reviewer

Perform structured, prioritized code reviews that find correctness, security, performance, reliability, and testing issues and provide concrete fix suggestions.

dotLottie Web

Guidelines and patterns for implementing performant dotLottie/Lottie animations on the web (vanilla JS and React), including workers, state machines, and themin

Party Engine Skill

Guidance and examples for using the @cazala/party particle engine (engine lifecycle, modules, WebGPU vs CPU patterns) in custom apps.

Party Skill

Programmatic guide for the @cazala/party library: engine setup, modules, particle APIs, and performance tips for WebGPU and CPU runtimes.

Portfolio Optimization

Select and balance project portfolios using constraint-based mathematical optimization to maximize value under budget, resource, and strategic constraints.

Back to Skills

AKO4ALL — Agentic Kernel Optimization

from ako4all262

Automated loop that profiles, iterates, benchmarks and commits GPU kernel optimizations across CUDA/Triton/TileLang/Python/C++ to achieve measurable speedups.

triggers:optimize this kernelAKOAKO4ALLncubenchmark kernelspeed up CUDAkernel optimization

GitHub SKILL.md

What it does

When to use it

What's included

Scripts: scaffolded bench-wrapper and scripts/bench.sh patterns (bench/kernelbench integration) when present in the repo; the skill provides guidance to render and run bench commands.
References: guidance to use ncu profiling and KernelBench-style inputs; the skill explains HINTS.md and ITERATIONS.md usage for persistent directives and experiment logs.
Instructions: detailed procedural workflow covering workspace inventory, branch/solution initialization, bench command generation, iteration protocol (bench → ITERATIONS.md entry → commit), profiling guidance, stall handling and finalization steps.

Compatible agents

Not yet audited

This skill has not been reviewed by our automated audit pipeline yet.

Information

Repository: ako4all
Stars: 262

Related Skills

OpenTestAI

Automated, high-confidence AI testing: bug detection, persona feedback, and prioritized test-case generation using many specialized tester profiles.

Go Data Structures

Authoritative guidance on choosing and using Go built-in and standard-library data structures, with practical best practices for slices, maps, arrays, container

React Development Expert

Provides authoritative React guidance on hooks, state patterns, Server Components, performance optimization, and common architectural patterns.

Code Reviewer

Perform structured, prioritized code reviews that find correctness, security, performance, reliability, and testing issues and provide concrete fix suggestions.

dotLottie Web

Guidelines and patterns for implementing performant dotLottie/Lottie animations on the web (vanilla JS and React), including workers, state machines, and themin

Party Engine Skill

Guidance and examples for using the @cazala/party particle engine (engine lifecycle, modules, WebGPU vs CPU patterns) in custom apps.

Party Skill

Programmatic guide for the @cazala/party library: engine setup, modules, particle APIs, and performance tips for WebGPU and CPU runtimes.

Portfolio Optimization

Select and balance project portfolios using constraint-based mathematical optimization to maximize value under budget, resource, and strategic constraints.

AKO4ALL — Agentic Kernel Optimization

What it does

When to use it

What's included

Compatible agents

Tags

Not yet audited

Information

Related Skills

AKO4ALL — Agentic Kernel Optimization

What it does

When to use it

What's included

Compatible agents

Tags

Not yet audited

Information

Related Skills