Name: Runpod Flash Deployment
Author: runpod

What it does

Runpod Flash provides a high-velocity development cycle for AI workloads. It allows developers to write code locally and execute it on remote Runpod GPUs or CPUs via flash dev, featuring hot-reloading that syncs function bodies instantly. Once stable, flash deploy ships the workload as a stable serverless endpoint.

When to use it

Use this skill when you need to deploy Python-based AI functions, manage GPU resources (from RTX 4090s to H100s), or set up load-balanced serverless APIs for ML models without the overhead of manual Docker management.

What's included

Instructions: Detailed CLI guide for authentication, project initialization, and environment management. It includes a comprehensive breakdown of the Endpoint constructor, GPU/CPU instance types, and specific "Gotchas" regarding cloudpickle and module imports.

Compatible agents

Agents with shell access and Python capabilities (e.g., Claude Code, Codex, or any ACP harness) that can drive a long-running background process and interact with it via HTTP.

What it does

When to use it

What's included

Instructions: Detailed CLI guide for authentication, project initialization, and environment management. It includes a comprehensive breakdown of the Endpoint constructor, GPU/CPU instance types, and specific "Gotchas" regarding cloudpickle and module imports.

Compatible agents

Agents with shell access and Python capabilities (e.g., Claude Code, Codex, or any ACP harness) that can drive a long-running background process and interact with it via HTTP.

Runpod Flash Deployment

What it does

When to use it

What's included

Compatible agents

Tags

Not yet audited

Information

Related Skills

Runpod Flash Deployment

What it does

When to use it

What's included

Compatible agents

Tags

Not yet audited

Information

Related Skills