Getting an unzipped text database from 36 to 6 MB via byte-pair encoding is pure magic :gmail56sparkles: