etch  ] [  etch-m68k  ] [  lenny  ] [  squeeze  ] [  sid  ]
[ Source: tesseract  ]

Package: tesseract-ocr (2.03-2)

Command line OCR tool

The Tesseract OCR engine was originally developed at HP between 1985 and 1995. It was open-sourced by HP and UNLV in 2005 and Google has lead further development.

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. It will read a binary, grey or color image and output text.

Tags: Accessibility Support: Text Recognition (OCR), Implemented in: C++, User Interface: Command Line, Role: Program, Scope: Utility, Purpose: Data Conversion, Works with: Image, Raster Image, Text

Other Packages Related to tesseract-ocr

  • depends
  • recommends
  • suggests

Download tesseract-ocr

Download for all available architectures
Architecture Package Size Installed Size Files
alpha 1,030.9 kB2692 kB [list of files]
amd64 881.8 kB2072 kB [list of files]
arm 902.1 kB2032 kB [list of files]
armel 852.2 kB1964 kB [list of files]
hppa 1,029.4 kB2272 kB [list of files]
i386 818.1 kB1992 kB [list of files]
ia64 1,369.9 kB4208 kB [list of files]
mips 998.4 kB2796 kB [list of files]
mipsel 997.7 kB2796 kB [list of files]
powerpc 967.2 kB2340 kB [list of files]
s390 886.2 kB2080 kB [list of files]
sparc 845.3 kB2028 kB [list of files]