Video Podcast Maker

Trust Score 90/100

Automates end-to-end production of long-form video podcasts using TTS, Remotion, and FFmpeg—supports multi-language output and Bilibili/YouTube publishing workf

triggers:make a videovideo podcastremotiongenerate videodesign learningcreate video podcastrender 4k

GitHub SKILL.md

What it does

Automates an end-to-end pipeline that turns a topic into a production-ready video podcast. The skill handles research, scriptwriting, TTS audio generation, Remotion composition and studio preview, thumbnail generation, and final MP4 rendering with background music. It includes a design-learning subsystem that extracts visual patterns from reference videos or images and applies them as style profiles to new compositions. The workflow is optimized for horizontal Bilibili-style knowledge videos but can generate vertical highlights for shorts.

When to use it

Use this skill when you want a coding agent to produce a complete video from a topic prompt ('Make a video podcast about X') with minimal manual steps, or when you need design-consistent templates learned from reference videos. It’s appropriate for: batch content production, repackaging articles into video, rapid prototyping of visual styles from references, and generating publish-ready 4K/1080p outputs.

What's included

Scripts: design learning and CLI wrappers for Remotion/FFmpeg (workflow files referenced in the repo).
References: detailed workflow-steps, design-guide, and troubleshooting docs are available in the repository for on-demand loading.
Instructions: step-by-step pipeline (topic research → script → TTS → Remotion studio preview → optional 4K render). The skill enforces mandatory studio preview before final 4K rendering and includes technical rules for thumbnails, safe zones, and audio sync.

Compatible agents

Best fit for coding agents and Claude Code-style tooling that can run Node/Remotion, Python/FFmpeg, and TTS backends. Works with agents that can call shell tools, manage files, and launch Remotion Studio for user review.

Audit Summary

Video Podcast Maker is a comprehensive end-to-end pipeline for producing 4K video podcasts from a topic, covering research, scripting, TTS, Remotion composition, rendering, and Bilibili/YouTube publishing. It features 13 scripts with a CLI dispatcher, standardized JSON output envelope, and multi-backend TTS support (Edge/Azure/Doubao/ElevenLabs/OpenAI/Google). Scripts are well-written with proper error handling but depend on local module siblings (tts/, cli_envelope.py) that can't be resolved when run in isolation. No security concerns — no hardcoded credentials, no destructive commands, and the update check is consent-based.

Watch Out

Scripts must be run from within the skill directory — they use sys.path.insert(0, dirname(__file__)) for local module imports
Requires AZURE_SPEECH_KEY env var for Azure TTS backend (edge-tts works without auth)
Depends on remotion-best-practices skill being loaded first
Requires node, ffmpeg, python3, npx as external binaries

Missing Dependencies

tts (local module package)cli_envelope (local module, needed by most scripts)

Notes

One of the most polished and comprehensive skills audited. Production-quality CLI envelope with error codes, request IDs, and latency tracking. The skill follows the AgentSkill spec excellently with progressive disclosure via references/, dependency declaration, and a clear 15-step workflow. The only notable gap is that most scripts can't run in isolation due to local module dependencies, but this is expected behavior for a skill designed to operate within its directory context.

Information

Repository: video-podcast-maker
Stars: 501

Trust Score

Overall90

Security95

Code Quality82

Architecture88

Usefulness88

Related Skills

Development Worktree

Create an isolated git worktree for feature work, auto-run project setup, and verify a clean test baseline before development.

Readwise Reader Document Management

Manage Readwise Reader documents: list, save, search, move, tag, highlight, export and bulk-edit via official and custom CLIs.

Bounty Hunter — Atlas

Persona skill: 'Atlas' — a profit-focused developer persona for discovering, evaluating and executing paid bounties or freelance tasks with ROI-aware workflows.

Junshi — Research Advisor

Daily strategic research advisor that scans arXiv/venues, digests papers, and proposes bold, ranked research ideas tailored to the user's profile.

Full Stack Builder

End-to-end builder that scaffolds, implements, tests, and optionally deploys web and API applications from a natural-language specification.

ezBookkeeping API Tools

Command-line API tools for ezBookkeeping: record and query transactions, retrieve accounts/categories/tags, and fetch exchange rates for self-hosted personal fi

Feishu Voice Sender

Convert MP3s and send them as native Feishu voice messages (playable voice clips) to users or groups.

Claw Bench

Benchmarking skill that guides an agent through a structured suite of capability tests and reporting steps for leaderboard submission.

Back to Skills