Get urls for downloading files for tokenization.
A dictionary containing urls for original GPT2 tokenizaiton and GPT-NeoX tokenization schemes
previous
cerebras.modelzoo.data_preparation.nlp.pile.download.download_tokenizer_files
next
cerebras.modelzoo.data_preparation.nlp.pile.download.get_urls_from_split