commit | 912621162f45b125d7efcda2424ec44f5e4ccd36 | [log] [tgz] |
---|---|---|
author | Scott Lobdell <slobdell@google.com> | Tue Nov 23 14:01:18 2021 +0000 |
committer | Scott Lobdell <slobdell@google.com> | Tue Nov 23 14:14:20 2021 +0000 |
tree | f36e0f9da5855a1abb42f107a4e0733d79080e65 | |
parent | 5e0b6c65bcdfef2a67482a2f90dd5872443a9c64 [diff] | |
parent | 38080cd9fd6c2bc70dd18365b543617f6abbfd82 [diff] |
Merge TP1A.211013.002 Change-Id: I9c6f66b9d063fcabc23ab0c1ea28431e5ff9b9a5
marisa-trie
MARISA: Matching Algorithm with Recursively Implemented StorAge
0.2.6
Matching Algorithm with Recursively Implemented StorAge (MARISA) is a static and space-efficient trie data structure. And libmarisa is a C++ library to provide an implementation of MARISA. Also, the package of libmarisa contains a set of command line tools for building and operating a MARISA-based dictionary.
A MARISA-based dictionary supports not only lookup but also reverse lookup, common prefix search and predictive search.
The biggest advantage of libmarisa is that its dictionary size is considerably more compact than others. See below for the dictionary size of other implementations.
Implementation | Size (bytes) | Remarks |
---|---|---|
darts-clone | 376,613,888 | Compacted double-array trie |
tx-trie | 127,727,058 | LOUDS-based trie |
marisa-trie | 50,753,560 | MARISA trie |
You can get the latest version via git clone
. Then, you can generate a configure
script via autoreconf -i
. After that, you can build and install libmarisa and its command line tools via configure
and make
. For details, see also documentation in docs
.
$ git clone https://github.com/s-yata/marisa-trie.git $ cd marisa-trie $ autoreconf -i $ ./configure --enable-native-code $ make $ make install