Agent Skills

SKILL.md packages that extend Claude Code, Cursor, Copilot, and other AI agents.

Trust Score Usefulness Recommended Stars Latest

Filters

FFmpeg Guide

dotfiles

Comprehensive FFmpeg reference for encoding, converting, streaming, filtering, and analyzing audio/video — command examples, common patterns, and troubleshootin

ffmpegvideoaudio

117

8 triggers

FFmpeg Cheatsheet

ffmpeg-cheatsheet

A practical FFmpeg/FFprobe command handbook for video/audio conversion, trimming, resizing, overlays, subtitles, thumbnails, GIFs, and media automation workflow

ffmpegvideoaudio

1,705

7 triggers

Audiowaveform Helper

TerminalSkills Skills Library

Generate PNG/SVG waveform images and JSON or binary peak data from audio files for web players and social previews, with batch processing tips and integration e

audiowaveformaudiowaveform

7 triggers

BluOS CLI (blu)

eliza

Control Bluesound and NAD speakers: discover devices, play/stop, group/ungroup, and set volume from the CLI.

audiobluesoundnad

18,165

7 triggers

Runway — Integrate Audio

skills

Guides server-side integration of Runway audio APIs: TTS, sound effects, voice isolation, dubbing and speech-to-speech conversion.

audiottsrunway

5 triggers

Speak Security Basics

claude-code-plugins-plus-skills

Security best practices for integrating Speak: API key management, audio data privacy, student data protection, and COPPA/FERPA compliance for production deploy

securityapiprivacy

2,382

7 triggers

Speak - Text-to-Speech (Kokoro / Noiz)

babysor

Convert text (or SRT timelines) into speech audio using local Kokoro or Noiz cloud backends, with voice cloning and timeline-aligned rendering.

ttstext-to-speechaudio

7 triggers

Audio Transcription (Whisper)

dotfiles

Step-by-step instructions to transcribe audio files (podcasts, MP3s, interviews) locally using OpenAI Whisper CLI, with model recommendations and output formats

transcriptionwhisperaudio

947

5 triggers

Xiaoyuzhou Podcast Transcription

xiaoyuzhou-transcription-skill

Automates transcription of Xiaoyuzhou podcast episodes, producing speaker-segmented verbatim transcripts, 5–8 key-point summaries, and 8–10 Q&A pairs; integrate

transcriptionpodcastasr

7 triggers

Voice Memo Organizer

voice-memo-organizer

Locate, transcribe (local whisper.cpp), summarize and index Apple Voice Memos into a searchable archive with titles, themes and key quotes.

audiotranscriptionmacos

5 triggers

Adhan Player

mkhlab

Play recorded or online Adhan (Islamic call to prayer) audio with selectable reciters and simple playback controls for various platforms.

audioadhanreligion

5 triggers

ListenHub — Podcast / TTS / Explainer

skills

Create podcasts, explainer videos, TTS, and AI images using ListenHub scripts; run the provided shell scripts to generate, check status, and download outputs.

audiopodcasttts

4,473

7 triggers

Sleep Story — Calming Narrative

aaas-vault

Create calming, slow-paced sleep stories and templates for bedtime audio: sensory-rich, low-stakes narratives designed to help listeners drift off.

creative-writingaudiosleep

5 triggers

Local Media Transcription

agent-skills

Transcribe local audio/video files (MP4/M4A/MP3/WAV) to text and optionally produce meeting minutes, speaker-separated transcripts, action items, and PPT-ready

transcriptionmediawhisper

5 triggers

Lark CLI — Minutes

lark-cli

Access Lark/Feishu meeting recordings: retrieve metadata, export transcripts (txt/SRT), and obtain temporary media download URLs via the lark CLI.

larkfeishucli

4 triggers

Godot Assets Pipeline

godotprompter

Guidance for importing and managing assets in Godot 4.3+ — image compression, 3D scene import, audio formats, resource types, and import settings.

godotassetsimport

180

7 triggers

Qwen3 TTS — Text-to-Speech & Voice Cloning

qwen3_tts_rs

Generate speech audio from text or clone voices from reference audio using Qwen3 TTS binaries and models; supports multiple named speakers, English and Chinese.

ttsspeechvoice-cloning

219

6 triggers

Granola — Common Errors & Fixes

claude-code-plugins-plus-skills

Diagnose and fix common Granola issues (audio capture, transcription, calendar sync, and integrations) with platform-specific remediation steps.

saasaudiotroubleshooting

2,382

5 triggers

Sound Designer

wayland

Guidance and workflows for Foley, SFX creation, ambiences, game audio (Wwise/FMOD), field recording, and sound library management for film, games, and interacti

audiosound-designfoley

509

9 triggers

Godot WeChat Minigame Adapter

godot-minigame

Apply a version-locked adapter and patch bundle to port an official Godot checkout to WeChat Mini Game, handling WXMEMFS, wasm packaging, audio fixes, and runti

godotwechatminigame

7 triggers

Runway Generate Audio

skills

Generate TTS, sound effects, voice isolation, dubbing, and speech-to-speech conversions via Runway's API using provided runnable scripts.

audiottssfx

5 triggers