This PDF skill equips an agent to operate on PDF documents reliably: extract text and tables, merge or split files, rotate and watermark pages, and run OCR on scanned documents. It emphasizes reproducible scripted operations and verifies outputs (page counts, extraction quality) so downstream analysis or manuscript workflows remain auditable.
Use this skill whenever a user asks the agent to interact with PDF artifacts — for example: extracting tables from a research paper, merging multiple reports into a single deliverable, splitting large scans into chapters, applying watermarks for distribution, or OCRing scanned pages for searchable text. It’s suited to reproducible research pipelines and document-prep tasks.
Best for agents with Python runtime access and file I/O (e.g., Claude Code, Copilot/Code assistants, or any agent able to run pypdf scripts). Suitable for paired-research assistants and workflow automation agents.
This skill has not been reviewed by our automated audit pipeline yet.