SKILL.md packages that extend Claude Code, Cursor, Copilot, and other AI agents.
Tags

dotfiles
Comprehensive FFmpeg reference for encoding, converting, streaming, filtering, and analyzing audio/video — command examples, common patterns, and troubleshootin

ffmpeg-cheatsheet
A practical FFmpeg/FFprobe command handbook for video/audio conversion, trimming, resizing, overlays, subtitles, thumbnails, GIFs, and media automation workflow

TerminalSkills Skills Library
Generate PNG/SVG waveform images and JSON or binary peak data from audio files for web players and social previews, with batch processing tips and integration e

eliza
Control Bluesound and NAD speakers: discover devices, play/stop, group/ungroup, and set volume from the CLI.

skills
Guides server-side integration of Runway audio APIs: TTS, sound effects, voice isolation, dubbing and speech-to-speech conversion.

claude-code-plugins-plus-skills
Security best practices for integrating Speak: API key management, audio data privacy, student data protection, and COPPA/FERPA compliance for production deploy

babysor
Convert text (or SRT timelines) into speech audio using local Kokoro or Noiz cloud backends, with voice cloning and timeline-aligned rendering.

dotfiles
Step-by-step instructions to transcribe audio files (podcasts, MP3s, interviews) locally using OpenAI Whisper CLI, with model recommendations and output formats

xiaoyuzhou-transcription-skill
Automates transcription of Xiaoyuzhou podcast episodes, producing speaker-segmented verbatim transcripts, 5–8 key-point summaries, and 8–10 Q&A pairs; integrate

voice-memo-organizer
Locate, transcribe (local whisper.cpp), summarize and index Apple Voice Memos into a searchable archive with titles, themes and key quotes.

mkhlab
Play recorded or online Adhan (Islamic call to prayer) audio with selectable reciters and simple playback controls for various platforms.

skills
Create podcasts, explainer videos, TTS, and AI images using ListenHub scripts; run the provided shell scripts to generate, check status, and download outputs.

aaas-vault
Create calming, slow-paced sleep stories and templates for bedtime audio: sensory-rich, low-stakes narratives designed to help listeners drift off.

agent-skills
Transcribe local audio/video files (MP4/M4A/MP3/WAV) to text and optionally produce meeting minutes, speaker-separated transcripts, action items, and PPT-ready

lark-cli
Access Lark/Feishu meeting recordings: retrieve metadata, export transcripts (txt/SRT), and obtain temporary media download URLs via the lark CLI.

godotprompter
Guidance for importing and managing assets in Godot 4.3+ — image compression, 3D scene import, audio formats, resource types, and import settings.

qwen3_tts_rs
Generate speech audio from text or clone voices from reference audio using Qwen3 TTS binaries and models; supports multiple named speakers, English and Chinese.

claude-code-plugins-plus-skills
Diagnose and fix common Granola issues (audio capture, transcription, calendar sync, and integrations) with platform-specific remediation steps.

wayland
Guidance and workflows for Foley, SFX creation, ambiences, game audio (Wwise/FMOD), field recording, and sound library management for film, games, and interacti

godot-minigame
Apply a version-locked adapter and patch bundle to port an official Godot checkout to WeChat Mini Game, handling WXMEMFS, wasm packaging, audio fixes, and runti

skills
Generate TTS, sound effects, voice isolation, dubbing, and speech-to-speech conversions via Runway's API using provided runnable scripts.