Pdf Extract

Extract text from PDF files for LLM processing

התקנה
$clawhub install pdf-extract

PDF Extract

Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.

Commands

# Extract all text from a PDF
pdf-extract "document.pdf"

# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5

Install

sudo dnf install poppler-utils