
from VideoDB72
Agent skill for ingesting, indexing, searching and programmatically editing video and desktop capture; returns playable streams, timestamps, clips, and transcod
VideoDB provides perception, indexing, and action capabilities for video, live streams (RTSP), and desktop capture. It can ingest files or streams, transcode/normalize media, extract spoken transcripts and visual scene indexes, search for moments with timestamps and playable evidence, and compose timeline edits (subtitles, overlays, dubbing) programmatically.
Use VideoDB when agents must understand video content, produce searchable timelines, generate clips, or perform server-side edits and transcodes. Typical scenarios: monitoring live feeds with alerts, producing session summaries from desktop recordings, automated subtitle generation and burn-in, and reformatting videos for social platforms.
scripts/ helpers and a WebSocket listener for desktop capture are present.reference/ directory documents indexing, editor workflows, streaming and capture.Designed for integration with agent runtimes that can run Python or call APIs; compatible with Claude/Codex/Gemini-backed agents or any agent that can call the VideoDB Python SDK or HTTP API.
VideoDB skill provides comprehensive video perception, indexing, search, and editing capabilities via the VideoDB Python SDK. The SKILL.md is exceptionally well-documented with clear sections, canonical prompts, error handling patterns, and a pitfalls table. The bundled ws_listener.py script is well-written with proper signal handling, auto-reconnect, and exponential backoff, but failed to run due to missing videodb package dependency.
videodbClean security profile — no hardcoded credentials, no shell injection risks, no suspicious network calls. Explicitly instructs agents NOT to handle API keys. Well-structured with good progressive disclosure via reference/ directory. The only script failed due to missing SDK dependency, which is expected in an audit environment.