all options
stretch  ] [  stretch-backports  ] [  buster  ] [  bullseye  ] [  sid  ]
[ Source: ocrmypdf  ]

Package: ocrmypdf (4.3.5-3)

Links for ocrmypdf


Debian Resources:

Download Source Package ocrmypdf:


External Resources:

Similar packages:

add an OCR text layer to PDF files

OCRmyPDF generates a searchable PDF/A file from a regular PDF containing only images, allowing it to be searched.

It uses the Tesseract OCR engine and so supports all the languages that Tesseract does.

Some other main features:

  * Places OCR text accurately below the image to ease copy / paste
  * Keeps the exact resolution of the original embedded images
  * When possible, inserts OCR information as a lossless operation
    without rendering vector information
  * Keeps file size about the same
  * If requested deskews and/or cleans the image before performing OCR
  * Validates input and output files
  * Provides debug mode to enable easy verification of the OCR results
  * Processes pages in parallel when more than one CPU core is
  * Battle-tested on thousands of PDFs, a test suite and continuous

Other Packages Related to ocrmypdf

  • depends
  • recommends
  • suggests
  • enhances

Download ocrmypdf

Download for all available architectures
Architecture Package Size Installed Size Files
all 57.2 kB235.0 kB [list of files]