
from babysor
Convert text (or SRT timelines) into speech audio using local Kokoro or Noiz cloud backends, with voice cloning and timeline-aligned rendering.
Convert any text into speech audio. Supports two backends (Kokoro local, Noiz cloud), two modes (simple or timeline-accurate), and per-segment voice control.
The speak skill provides text-to-speech functionality via Kokoro (local) and Noiz (cloud) backends, with support for simple mode and timeline-aligned SRT rendering for dubbing. No bundled scripts were present to test. The SKILL.md is well-structured with clear examples, triggers, and a comparison table, but references scripts (tts.sh) that aren't included in the audit payload.
No scripts bundled for execution testing. SKILL.md references skills/speak/scripts/tts.sh which appears to be a real script but wasn't provided in the audit payload. Clean security profile with no concerning patterns.