diff --git a/Poppler-Utils:-Tools-and-Usage.md b/Poppler-Utils:-Tools-and-Usage.md new file mode 100644 index 0000000..0bbcd7f --- /dev/null +++ b/Poppler-Utils:-Tools-and-Usage.md @@ -0,0 +1,54 @@ +## `pdfdetach` +- **Summary**: Extracts embedded files (attachments) from a PDF. +- **Projects**: Extracting data files, source code, or other attachments embedded in PDFs for academic papers or reports. +- **Command**: `pdfdetach -saveall input.pdf` + +## `pdffonts` +- **Summary**: Lists the fonts used in a PDF document. +- **Projects**: Font analysis for document design consistency, troubleshooting font issues in PDFs. +- **Command**: `pdffonts input.pdf` + +## `pdfimages` +- **Summary**: Extracts images from a PDF file. +- **Projects**: Retrieving all images for documentation, presentations, or image analysis. +- **Command**: `pdfimages -all input.pdf output_prefix` + +## `pdfinfo` +- **Summary**: Provides detailed information about a PDF, including metadata. +- **Projects**: Analyzing PDFs for metadata, such as author, creation date, number of pages. +- **Command**: `pdfinfo input.pdf` + +## `pdfseparate` +- **Summary**: Splits a PDF document into individual pages. +- **Projects**: Extracting specific pages from a document for separate use or analysis. +- **Command**: `pdfseparate input.pdf output_%d.pdf` + +## `pdftocairo` +- **Summary**: Converts PDF documents to other formats like PNG, JPEG, PS, EPS, SVG. +- **Projects**: Creating thumbnails, converting PDFs for web use, generating vector images from PDFs. +- **Command**: `pdftocairo -png input.pdf output` + +## `pdftohtml` +- **Summary**: Converts a PDF file to HTML. +- **Projects**: Converting PDFs to HTML for web publishing, extracting content for web use. +- **Command**: `pdftohtml -c input.pdf output.html` + +## `pdftoppm` +- **Summary**: Converts PDF pages to image formats like PNG or JPEG. +- **Projects**: Creating high-quality images from PDF pages for presentations or documentation. +- **Command**: `pdftoppm -png input.pdf output` + +## `pdftops` +- **Summary**: Converts a PDF to PostScript format. +- **Projects**: Preparing PDFs for printing or for use in graphics applications. +- **Command**: `pdftops input.pdf output.ps` + +## `pdftotext` +- **Summary**: Converts a PDF to plain text. +- **Projects**: Extracting text for analysis, archiving, or conversion to other text formats. +- **Command**: `pdftotext input.pdf output.txt` + +## `pdfunite` +- **Summary**: Merges several PDF files into one. +- **Projects**: Combining multiple PDF documents into a single file for reports or booklets. +- **Command**: `pdfunite input1.pdf input2.pdf output.pdf`