etch  ] [  etch-m68k  ] [  lenny  ] [  squeeze  ] [  sid  ]
[ Source: tesseract  ]

Package: tesseract-ocr (2.04-1)

Command line OCR tool

The Tesseract OCR engine was originally developed at HP between 1985 and 1995. It was open-sourced by HP and UNLV in 2005 and Google has lead further development.

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. It will read a binary, grey or color image and output text.

Tags: Accessibility Support: Text Recognition (OCR), Implemented in: C++, User Interface: Command Line, Role: Program, Scope: Utility, Purpose: Data Conversion, Works with: Image, Raster Image, Text

Other Packages Related to tesseract-ocr

  • depends
  • recommends
  • suggests

Download tesseract-ocr

Download for all available architectures
Architecture Package Size Installed Size Files
amd64 1,020.2 kB3064 kB [list of files]
armel 1,067.6 kB3872 kB [list of files]
hppa 1,256.5 kB4948 kB [list of files]
i386 978.3 kB2984 kB [list of files]
ia64 1,439.3 kB7496 kB [list of files]
mips 1,166.3 kB5644 kB [list of files]
mipsel 1,141.2 kB5640 kB [list of files]
powerpc 1,008.6 kB3700 kB [list of files]
s390 1,113.7 kB4020 kB [list of files]
sparc 1,150.0 kB4680 kB [list of files]