pdf.utils

Module Contents

Functions

extract_pdf_info(→ tuple[int, str])

Extracts the number of pages and text from a PDF.

pdf.utils.extract_pdf_info(content: SupportsRead[bytes], remove: str = '\x00') tuple[int, str][source]

Extracts the number of pages and text from a PDF.

Requires poppler.