This skill equips an agent to handle common PDF tasks using Python libraries and command-line tools: extract text and tables (pdfplumber, pypdf), merge and split PDFs, rotate pages, add watermarks, perform OCR on scanned documents (pytesseract + pdf2image), extract images, fill forms, and create PDFs (reportlab). It also documents command-line utilities (pdftotext, qpdf, pdftk) and provides runnable code snippets.
Use when the user mentions a .pdf file or asks to read, extract, transform, or produce PDFs—examples include converting scanned documents to searchable text, extracting tabular data into spreadsheets, merging reports, adding watermarks or passwords, and programmatically generating reports.
Compatible with agents that can run Python or command-line tools and have access to the filesystem for reading/writing PDF files.
This skill has not been reviewed by our automated audit pipeline yet.
Gmail Automation via Rube MCP
Automate Gmail actions (send, reply, search, labels, drafts, attachments) through a Rube MCP Gmail toolkit with best-practice tool sequences and pitfalls noted.
Nano Banana 2 — Gemini 3.1 Flash Image Preview
Run Google Gemini 3.1 Flash Image Preview via inference.sh CLI: text-to-image, image editing, multi-image input, and Google Search grounding.