
from media-extract-skill16
Extracts and analyses media (YouTube, web articles, local audio/video, transcripts). Produces structured summaries, chapters, quotes, code examples and visual a
Media Extract is a utility skill that ingests content from YouTube, web pages, local files (video, audio, PDF, text) or pasted transcripts and returns structured, parseable outputs: summaries, chapter timestamps, golden nuggets, quotes, command/code examples, and visual analysis via Gemini. It can download videos using yt-dlp, clean meeting transcripts (remove timestamps and filler words), and enrich outputs with metadata (channel, publish date, duration, views, likes, engagement). The skill focuses on consistent, machine-friendly formats so downstream skills can consume the results programmatically.
Use this skill when a user shares a YouTube link, posts an article URL, uploads or points to a local media file, pastes a transcript, or asks explicitly to "clean this transcript", "remove timestamps", or to "analyze this video/article". Also useful for batch processing playlists or folders of videos and for extracting code shown visually on-screen.
Best suited for agents with file and web access and Gemini integration (e.g., Claude with file+tooling, agents using Gemini visual models, or other assistant runtimes that can run local scripts and call external APIs).
This skill has not been reviewed by our automated audit pipeline yet.