bpupgrade
bpupgrade — Poliqarp binary corpus converter
bpupgrade
{ -h
| --help
| -v
| --version
}
bpupgrade
[option
...] corpus-base-name
Poliqarp 1.3 lifts limitations on corpora sizes: it should be possible to build and process any reasonable corpus up to 2G segments. Unfortunately, the binary corpus format needed to be changed.
You can check version of your corpus by inspecting the
file:
*
.cdf
lack of the
indicates the old format;
*
.cdf
version = 1
string indicates the old format;
version = 2
string indicates the new one.
Sakura, the underlying library, does no longer support the old format. However, bpupgrade, the conversion utility is provided.