What it does

This skill documents how to use vLLM-Omni with Wan2.2 and related models to generate videos in three modes: text-to-video (T2V), image-to-video (I2V), and text+image-to-video (TI2V). It provides quick-start code samples for offline and API usage, model IDs, recommended VRAM, common generation parameters (steps, guidance scale, frames, fps), and troubleshooting tips for memory and performance.

When to use it

Use it when you need programmatic video generation from prompts or reference images, when experimenting with diffusion transformer models for motion, or when building pipelines that convert text/image inputs into short video outputs. Suitable for researchers and engineers with GPU resources (24–48GB VRAM for larger models).

What's included

Scripts: none detected in the skill directory (has_scripts=false)
References: model details and advanced config referenced in references/wan-models.md (has_references=true)
Instructions: example Python and curl snippets, model recommendations, parameter ranges, and performance/troubleshooting notes.