
A guided CLI workflow that extracts text from academic PDFs (PyMuPDF + Tesseract), generates structured Obsidian notes, and creates JSON Canvas critical-thinkin
PhD Deep Read packages a four-stage pipeline for turning academic PDFs into richly structured literature notes and critical-thinking canvases. It uses a Text-First decision tree (PyMuPDF for searchable pages with Tesseract OCR fallback) to extract text and images, then generates Obsidian-friendly markdown with YAML frontmatter and Dataview callouts. The skill also produces JSON Canvas files for deep analysis and includes verification steps to ensure output consistency.
Use this skill when processing individual or batches of academic PDFs for literature reviews, generating reproducible notes for Obsidian, or when you need structured synthesis and critique (assumptions, evidence assessment, future directions). It's appropriate for researchers, graduate students, and knowledge workers preparing reading corpora.
Works with agents that can run or orchestrate CLI/python tools (Claude Code, assistant shells, or local CLI wrappers). Best when the environment provides PyMuPDF and Tesseract for OCR and when the agent can read/write files for Obsidian integration.
Cette compétence n'a pas encore été examinée par notre pipeline d'audit automatisé.