всички настройки
forky  ] [  sid  ]
[ Източник: simdutf  ]

Пакет: libsimdutf-tools (8.0.0-1)

Връзки за libsimdutf-tools

Screenshot

Ресурси за Debian:

Изтегляне на пакет-източник simdutf.

Отговорници:

Външни препратки:

Подобни пакети:

Fast Unicode validation and transcoding - utilities

Most modern software relies on the Unicode standard. In memory, Unicode strings are represented using either UTF-8 or UTF-16. The UTF-8 format is the de facto standard on the web (JSON, HTML, etc.) and it has been adopted as the default in many popular programming languages (Go, Zig, Rust, Swift, etc.). The UTF-16 format is standard in Java, C# and in many Windows technologies.

Not all sequences of bytes are valid Unicode strings. It is unsafe to use Unicode strings in UTF-8 and UTF-16LE without first validating them. Furthermore, we often need to convert strings from one encoding to another, by a process called transcoding. For security purposes, such transcoding should be validating: it should refuse to transcode incorrect strings.

This library provide fast Unicode functions such as

 * ASCII, UTF-8, UTF-16LE/BE and UTF-32 validation, with and without error
   identification,
 * Latin1 to UTF-8 transcoding,
 * Latin1 to UTF-16LE/BE transcoding
 * Latin1 to UTF-32 transcoding
 * UTF-8 to Latin1 transcoding, with or without validation, with and without
   error identification,
 * UTF-8 to UTF-16LE/BE transcoding, with or without validation, with and
   without error identification,
 * UTF-8 to UTF-32 transcoding, with or without validation, with and without
   error identification,
 * UTF-16LE/BE to Latin1 transcoding, with or without validation, with and
   without error identification,
 * UTF-16LE/BE to UTF-8 transcoding, with or without validation, with and
   without error identification,
 * UTF-32 to Latin1 transcoding, with or without validation, with and without
   error identification,
 * UTF-32 to UTF-8 transcoding, with or without validation, with and without
   error identification,
 * UTF-32 to UTF-16LE/BE transcoding, with or without validation, with and
   without error identification,
 * UTF-16LE/BE to UTF-32 transcoding, with or without validation, with and
   without error identification,
 * From an UTF-8 string, compute the size of the Latin1 equivalent string,
 * From an UTF-8 string, compute the size of the UTF-16 equivalent string,
 * From an UTF-8 string, compute the size of the UTF-32 equivalent string
   (equivalent to UTF-8 character counting),
 * From an UTF-16LE/BE string, compute the size of the Latin1 equivalent
   string,
 * From an UTF-16LE/BE string, compute the size of the UTF-8 equivalent
   string,
 * From an UTF-32 string, compute the size of the UTF-8 or UTF-16LE equivalent
   string,
 * From an UTF-16LE/BE string, compute the size of the UTF-32 equivalent
   string (equivalent to UTF-16 character counting),
 * UTF-8 and UTF-16LE/BE character counting,
 * UTF-16 endianness change (UTF16-LE/BE to UTF-16-BE/LE),
 * WHATWG forgiving-base64 (with or without URL encoding) to binary,
 * Binary to base64 (with or without URL encoding).

The functions are accelerated using SIMD instructions (e.g., ARM NEON, SSE, AVX, AVX-512, RISC-V Vector Extension, LoongSon, POWER, etc.). When your strings contain hundreds of characters, we can often transcode them at speeds exceeding a billion characters per second. You should expect high speeds not only with English strings (ASCII) but also Chinese, Japanese, Arabic, and so forth. We handle the full character range (including, for example, emojis).

The library compiles down to a small library of a few hundred kilobytes. Our functions are exception-free and non allocating. We have extensive tests and extensive benchmarks.

This package ships several command line tools.

Други пакети, свързани с libsimdutf-tools

  • зависимости
  • препоръчани
  • предложени
  • enhances

Изтегляне на libsimdutf-tools

Изтегляне за всички налични архитектури
Архитектура Големина на пакета Големина след инсталиране Файлове
alpha (неофициална архитектура) 492,5 кБ3 424,0 кБ [списък на файловете]
amd64 484,0 кБ2 734,0 кБ [списък на файловете]
arm64 426,2 кБ2 838,0 кБ [списък на файловете]
armhf 378,1 кБ1 876,0 кБ [списък на файловете]
hppa (неофициална архитектура) 464,5 кБ2 841,0 кБ [списък на файловете]
i386 510,8 кБ2 776,0 кБ [списък на файловете]
loong64 457,3 кБ2 964,0 кБ [списък на файловете]
m68k (неофициална архитектура) 457,8 кБ2 596,0 кБ [списък на файловете]
ppc64 (неофициална архитектура) 526,4 кБ4 310,0 кБ [списък на файловете]
ppc64el 609,9 кБ4 630,0 кБ [списък на файловете]
riscv64 474,0 кБ2 310,0 кБ [списък на файловете]
s390x 514,8 кБ3 372,0 кБ [списък на файловете]
sh4 (неофициална архитектура) 487,4 кБ2 520,0 кБ [списък на файловете]
sparc64 (неофициална архитектура) 393,2 кБ4 126,0 кБ [списък на файловете]
x32 (неофициална архитектура) 483,5 кБ2 512,0 кБ [списък на файловете]