all options
wheezy  ] [  jessie  ] [  sid  ]
[ Source: ucto  ]

Package: ucto (0.5.3-3.1 and others)

Links for ucto


Debian Resources:

Download Source Package ucto:


External Resources:

Similar packages:

Unicode Tokenizer

Ucto can tokenize UTF-8 encoded text files (i.e. separate words from punctuation, split sentences, generate n-grams), and offers several other basic preprocessing steps (change case, count words/characters and reverse lines) that make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

Ucto is a product of the ILK Research Group, Tilburg University (The Netherlands).

If you are interested in machine parsing of UTF-8 encoded text files, e.g. to do scientific research in natural language processing, ucto will likely be of use to you.

Tags: Implemented in: C++, Role: Program

Other Packages Related to ucto

  • depends
  • recommends
  • suggests
  • enhances

Download ucto

Download for all available architectures
Architecture Version Package Size Installed Size Files
alpha (unofficial port) 0.5.3-3.1+b1 38.3 kB133.0 kB [list of files]
amd64 0.5.3-3.1+b1 37.9 kB124.0 kB [list of files]
arm64 (unofficial port) 0.5.3-3.1 38.0 kB124.0 kB [list of files]
armel 0.5.3-3.1+b1 38.1 kB123.0 kB [list of files]
armhf 0.5.3-3.1+b1 37.5 kB115.0 kB [list of files]
hppa (unofficial port) 0.5.3-3.1 39.3 kB131.0 kB [list of files]
hurd-i386 0.5.3-3.1+b1 37.8 kB119.0 kB [list of files]
i386 0.5.3-3.1+b1 37.9 kB76.0 kB [list of files]
kfreebsd-amd64 0.5.3-3.1+b1 37.9 kB86.0 kB [list of files]
kfreebsd-i386 0.5.3-3.1+b1 37.8 kB81.0 kB [list of files]
mips 0.5.3-3.1+b1 37.5 kB124.0 kB [list of files]
mipsel 0.5.3-3.1+b1 37.6 kB124.0 kB [list of files]
powerpc 0.5.3-3.1+b1 38.0 kB119.0 kB [list of files]
powerpcspe (unofficial port) 0.5.3-3.1 37.5 kB119.0 kB [list of files]
ppc64 (unofficial port) 0.5.3-3.1+b1 37.9 kB124.0 kB [list of files]
s390x 0.5.3-3.1+b1 38.1 kB124.0 kB [list of files]
sh4 (unofficial port) 0.5.3-3.1 42.0 kB123.0 kB [list of files]
sparc 0.5.3-3.1+b1 37.2 kB124.0 kB [list of files]
sparc64 (unofficial port) 0.5.3-3.1+b1 37.4 kB126.0 kB [list of files]
x32 (unofficial port) 0.5.3-3.1 37.8 kB119.0 kB [list of files]