Package: r-cran-tokenizers (0.3.0-1)
Links for r-cran-tokenizers
Debian Resources:
Download Source Package r-cran-tokenizers:
- [r-cran-tokenizers_0.3.0-1.dsc]
- [r-cran-tokenizers_0.3.0.orig.tar.gz]
- [r-cran-tokenizers_0.3.0-1.debian.tar.xz]
Maintainers:
External Resources:
- Homepage [cran.r-project.org]
Similar packages:
GNU R fast, consistent tokenization of natural language text
Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the 'stringi' and 'Rcpp' packages for fast yet correct tokenization in 'UTF-8'.
Other Packages Related to r-cran-tokenizers
|
|
|
|
-
- dep: libc6 (>= 2.14) [amd64]
- GNU C Library: Shared libraries
also a virtual package provided by libc6-udeb
- dep: libc6 (>= 2.17) [arm64, ppc64el]
- dep: libc6 (>= 2.4) [not amd64, arm64, ppc64el]
-
- dep: libgcc-s1 (>= 3.0) [not armel, armhf]
- GCC support library
- dep: libgcc-s1 (>= 3.5) [armel, armhf]
-
- dep: libstdc++6 (>= 11)
- GNU Standard C++ Library v3
-
- dep: r-api-4.0
- virtual package provided by r-base-core
-
- dep: r-base-core (>= 4.2.2.20221110-1)
- GNU R core of statistical computation and graphics system
-
- dep: r-cran-rcpp (>= 0.12.3)
- GNU R package for Seamless R and C++ Integration
-
- dep: r-cran-snowballc (>= 0.5.1)
- Snowball stemmers based on the C libstemmer UTF-8 library
-
- dep: r-cran-stringi (>= 1.0.1)
- GNU R character string processing facilities
-
- rec: r-cran-testthat
- GNU R testsuite
-
- sug: r-cran-covr
- test coverage for GNU R packages
-
- sug: r-cran-knitr
- GNU R package for dynamic report generation using Literate Programming
-
- sug: r-cran-rmarkdown
- convert R markdown documents into a variety of formats
Download r-cran-tokenizers
Architecture | Package Size | Installed Size | Files |
---|---|---|---|
amd64 | 641.6 kB | 845.0 kB | [list of files] |
arm64 | 636.3 kB | 861.0 kB | [list of files] |
armel | 636.2 kB | 860.0 kB | [list of files] |
armhf | 637.6 kB | 860.0 kB | [list of files] |
i386 | 642.9 kB | 847.0 kB | [list of files] |
mips64el | 635.5 kB | 869.0 kB | [list of files] |
mipsel | 635.9 kB | 863.0 kB | [list of files] |
ppc64el | 642.1 kB | 925.0 kB | [list of files] |
s390x | 637.7 kB | 849.0 kB | [list of files] |