Search Criteria
Package Details: ucto 0.34-1
Package Actions
Git Clone URL: | https://aur.archlinux.org/ucto.git (read-only, click to copy) |
---|---|
Package Base: | ucto |
Description: | An advanced rule-based (regular-expression) and unicode-aware tokenizer for various languages. Tokenization is an essential first step in any NLP pipeline. |
Upstream URL: | https://languagemachines.github.io/ucto |
Keywords: | nlp tokenization tokenizer |
Licenses: | GPL3 |
Submitter: | proycon |
Maintainer: | proycon |
Last Packager: | proycon |
Votes: | 1 |
Popularity: | 0.000000 |
First Submitted: | 2014-11-28 18:58 (UTC) |
Last Updated: | 2024-09-12 09:57 (UTC) |
Dependencies (8)
- icu (icu-gitAUR)
- libfoliaAUR
- libxml2 (libxml2-gitAUR, libxml2-2.9AUR)
- ticcutilsAUR
- uctodataAUR
- autoconf (autoconf-gitAUR) (make)
- autoconf-archive (autoconf-archive-gitAUR) (make)
- libtool (libtool-gitAUR) (make)