Files
the_information_nexus/tech_docs/linux/Poppler-Utils:-Tools-and-Usage.md
2024-05-01 12:28:44 -06:00

55 lines
2.3 KiB
Markdown

## `pdfdetach`
- **Summary**: Extracts embedded files (attachments) from a PDF.
- **Projects**: Extracting data files, source code, or other attachments embedded in PDFs for academic papers or reports.
- **Command**: `pdfdetach -saveall input.pdf`
## `pdffonts`
- **Summary**: Lists the fonts used in a PDF document.
- **Projects**: Font analysis for document design consistency, troubleshooting font issues in PDFs.
- **Command**: `pdffonts input.pdf`
## `pdfimages`
- **Summary**: Extracts images from a PDF file.
- **Projects**: Retrieving all images for documentation, presentations, or image analysis.
- **Command**: `pdfimages -all input.pdf output_prefix`
## `pdfinfo`
- **Summary**: Provides detailed information about a PDF, including metadata.
- **Projects**: Analyzing PDFs for metadata, such as author, creation date, number of pages.
- **Command**: `pdfinfo input.pdf`
## `pdfseparate`
- **Summary**: Splits a PDF document into individual pages.
- **Projects**: Extracting specific pages from a document for separate use or analysis.
- **Command**: `pdfseparate input.pdf output_%d.pdf`
## `pdftocairo`
- **Summary**: Converts PDF documents to other formats like PNG, JPEG, PS, EPS, SVG.
- **Projects**: Creating thumbnails, converting PDFs for web use, generating vector images from PDFs.
- **Command**: `pdftocairo -png input.pdf output`
## `pdftohtml`
- **Summary**: Converts a PDF file to HTML.
- **Projects**: Converting PDFs to HTML for web publishing, extracting content for web use.
- **Command**: `pdftohtml -c input.pdf output.html`
## `pdftoppm`
- **Summary**: Converts PDF pages to image formats like PNG or JPEG.
- **Projects**: Creating high-quality images from PDF pages for presentations or documentation.
- **Command**: `pdftoppm -png input.pdf output`
## `pdftops`
- **Summary**: Converts a PDF to PostScript format.
- **Projects**: Preparing PDFs for printing or for use in graphics applications.
- **Command**: `pdftops input.pdf output.ps`
## `pdftotext`
- **Summary**: Converts a PDF to plain text.
- **Projects**: Extracting text for analysis, archiving, or conversion to other text formats.
- **Command**: `pdftotext input.pdf output.txt`
## `pdfunite`
- **Summary**: Merges several PDF files into one.
- **Projects**: Combining multiple PDF documents into a single file for reports or booklets.
- **Command**: `pdfunite input1.pdf input2.pdf output.pdf`