PDF

Trust Score 64/100

Handle PDF tasks: extract text/tables, merge/split, rotate, watermark, and OCR scanned documents with reproducible rules and verifiable outputs.

triggers:extract text from pdfmerge pdfssplit pdfocr scanned documentrotate pageswatermark pdf

GitHub SKILL.md

What it does

This PDF skill equips an agent to operate on PDF documents reliably: extract text and tables, merge or split files, rotate and watermark pages, and run OCR on scanned documents. It emphasizes reproducible scripted operations and verifies outputs (page counts, extraction quality) so downstream analysis or manuscript workflows remain auditable.

When to use it

Use this skill whenever a user asks the agent to interact with PDF artifacts — for example: extracting tables from a research paper, merging multiple reports into a single deliverable, splitting large scans into chapters, applying watermarks for distribution, or OCRing scanned pages for searchable text. It’s suited to reproducible research pipelines and document-prep tasks.

What's included

Scripts: none bundled in this SKILL.md (has_scripts=false).
References: none included (has_references=false).
Instructions: working rules that prioritize reproducibility, preservation of page order and metadata, validation of output counts, and provenance recording for transformations. A small code example shows how to inspect page count using pypdf.

Compatible agents

Best for agents with Python runtime access and file I/O (e.g., Claude Code, Copilot/Code assistants, or any agent able to run pypdf scripts). Suitable for paired-research assistants and workflow automation agents.

Audit Summary

This skill is a thin wrapper around Anthropic's official PDF skill, providing minimal instructions — just a brief SKILL.md with triggers, working rules, and a tiny pypdf code snippet. No scripts are bundled, no output contracts defined, and the actual implementation is left entirely to the agent. The skill is essentially a pointer to another repo's skill with a few bullet points.

Watch Out

No actual scripts or tooling — agent must implement PDF operations from scratch each time
Just a stub referencing Anthropic's official PDF skill, not a standalone skill

Notes

Derived/forked from Anthropic's official skills repo. Minimal content — mostly a pointer with brief working rules. No security concerns since there's no executable code. Low usefulness because it provides almost no automation; the agent could follow the same rules without this skill.

Information

Repository: sciclaw
Stars: 72

Trust Score

Overall64

Security95

Code Quality35

Architecture30

Usefulness40

Related Skills

Development Worktree

Create an isolated git worktree for feature work, auto-run project setup, and verify a clean test baseline before development.

Readwise Reader Document Management

Manage Readwise Reader documents: list, save, search, move, tag, highlight, export and bulk-edit via official and custom CLIs.

Bounty Hunter — Atlas

Persona skill: 'Atlas' — a profit-focused developer persona for discovering, evaluating and executing paid bounties or freelance tasks with ROI-aware workflows.

Junshi — Research Advisor

Daily strategic research advisor that scans arXiv/venues, digests papers, and proposes bold, ranked research ideas tailored to the user's profile.

Full Stack Builder

End-to-end builder that scaffolds, implements, tests, and optionally deploys web and API applications from a natural-language specification.

ezBookkeeping API Tools

Command-line API tools for ezBookkeeping: record and query transactions, retrieve accounts/categories/tags, and fetch exchange rates for self-hosted personal fi

Feishu Voice Sender

Convert MP3s and send them as native Feishu voice messages (playable voice clips) to users or groups.

Claw Bench

Benchmarking skill that guides an agent through a structured suite of capability tests and reporting steps for leaderboard submission.

Back to Skills

PDF