Return set of symbol pairs in a word.
Word is represented as tuple of symbols (symbols being variable-length strings).
previous
cerebras.modelzoo.data_preparation.nlp.tokenizers.BPETokenizer.bytes_to_unicode
next
cerebras.modelzoo.data_preparation.nlp.tokenizers.BPETokenizer.BPETokenizer