This skill equips an agent to handle common PDF tasks using Python libraries and command-line tools: extract text and tables (pdfplumber, pypdf), merge and split PDFs, rotate pages, add watermarks, perform OCR on scanned documents (pytesseract + pdf2image), extract images, fill forms, and create PDFs (reportlab). It also documents command-line utilities (pdftotext, qpdf, pdftk) and provides runnable code snippets.
Use when the user mentions a .pdf file or asks to read, extract, transform, or produce PDFs—examples include converting scanned documents to searchable text, extracting tabular data into spreadsheets, merging reports, adding watermarks or passwords, and programmatically generating reports.
Compatible with agents that can run Python or command-line tools and have access to the filesystem for reading/writing PDF files.
Comprehensive PDF processing skill covering text extraction, merging, splitting, OCR, form filling, and watermarking. 9 scripts provided; only the bounding-box tests ran cleanly — 6 scripts failed on missing Python deps (pypdf, pdfplumber, pdf2image, Pillow), 2 needed CLI args. SKILL.md is well-structured with clear instructions and progressive disclosure to REFERENCE.md and FORMS.md.
pypdfpdf2imagepdfplumberPillowNo security concerns. The monkey-patching in fill_fillable_fields.py is a workaround for a pypdf issue — functional but fragile. Skill is well-documented with practical examples. The proprietary license is worth noting.
Gmail Automation via Rube MCP
Automate Gmail actions (send, reply, search, labels, drafts, attachments) through a Rube MCP Gmail toolkit with best-practice tool sequences and pitfalls noted.
Nano Banana 2 — Gemini 3.1 Flash Image Preview
Run Google Gemini 3.1 Flash Image Preview via inference.sh CLI: text-to-image, image editing, multi-image input, and Google Search grounding.
XLSX / Spreadsheet Skill
Handle creation, editing, cleaning, and conversion of spreadsheet files (.xlsx, .xlsm, .csv, .tsv) with robust formulas, formatting, and recalculation workflows
Baidu Search
Enable agents to perform real-time Chinese web and image searches via Baidu Qianfan API, including time filtering and relevance/authority signals.
PPTX — Presentation (.pptx) Skill
Create, read, edit, and QA .pptx presentations: extract text, generate thumbnails, edit templates, and convert slides for visual QA and exports.