Files
the_information_nexus/tech_docs/linux/Poppler-Utils:-Tools-and-Usage.md
2024-05-01 12:28:44 -06:00

2.3 KiB

pdfdetach

  • Summary: Extracts embedded files (attachments) from a PDF.
  • Projects: Extracting data files, source code, or other attachments embedded in PDFs for academic papers or reports.
  • Command: pdfdetach -saveall input.pdf

pdffonts

  • Summary: Lists the fonts used in a PDF document.
  • Projects: Font analysis for document design consistency, troubleshooting font issues in PDFs.
  • Command: pdffonts input.pdf

pdfimages

  • Summary: Extracts images from a PDF file.
  • Projects: Retrieving all images for documentation, presentations, or image analysis.
  • Command: pdfimages -all input.pdf output_prefix

pdfinfo

  • Summary: Provides detailed information about a PDF, including metadata.
  • Projects: Analyzing PDFs for metadata, such as author, creation date, number of pages.
  • Command: pdfinfo input.pdf

pdfseparate

  • Summary: Splits a PDF document into individual pages.
  • Projects: Extracting specific pages from a document for separate use or analysis.
  • Command: pdfseparate input.pdf output_%d.pdf

pdftocairo

  • Summary: Converts PDF documents to other formats like PNG, JPEG, PS, EPS, SVG.
  • Projects: Creating thumbnails, converting PDFs for web use, generating vector images from PDFs.
  • Command: pdftocairo -png input.pdf output

pdftohtml

  • Summary: Converts a PDF file to HTML.
  • Projects: Converting PDFs to HTML for web publishing, extracting content for web use.
  • Command: pdftohtml -c input.pdf output.html

pdftoppm

  • Summary: Converts PDF pages to image formats like PNG or JPEG.
  • Projects: Creating high-quality images from PDF pages for presentations or documentation.
  • Command: pdftoppm -png input.pdf output

pdftops

  • Summary: Converts a PDF to PostScript format.
  • Projects: Preparing PDFs for printing or for use in graphics applications.
  • Command: pdftops input.pdf output.ps

pdftotext

  • Summary: Converts a PDF to plain text.
  • Projects: Extracting text for analysis, archiving, or conversion to other text formats.
  • Command: pdftotext input.pdf output.txt

pdfunite

  • Summary: Merges several PDF files into one.
  • Projects: Combining multiple PDF documents into a single file for reports or booklets.
  • Command: pdfunite input1.pdf input2.pdf output.pdf