
from literature-harvest60
Reproducible local workflow to search PubMed/Crossref/OpenAlex, build candidate tables, download legally accessible PDFs/HTML, and deduplicate harvested literat
This skill packages a reproducible literature-harvesting pipeline for keyword-driven research. It automates searching scholarly APIs (PubMed/PMC, Europe PMC, Crossref, OpenAlex), compiles a candidate list, downloads legally accessible full texts (preferring PDFs, saving HTML/XML when necessary), performs a second-pass HTML-to-PDF chase, and deduplicates results into a clean run folder with a manifest.
Use when a researcher or agent needs to gather a large set of domain-specific papers for review, systematic searches, or downstream analysis (text-mining, citation mapping). It's useful for reproducible projects where provenance, deduplication, and manifesting are required.
literature_harvest/scripts/ stack (search and download helpers, run/continue scripts).references/ per the skill instructions.Best used by agent workflows that can run local scripts and manage filesystem outputs (research-assistant agents, data-engineering assistants, or CLI-capable agents).
This skill has not been reviewed by our automated audit pipeline yet.