Get urls given split of dataset.
split (str) – Split of dataset to get urls for.
List of urls, containing jsonl.zst file names for downloading.
previous
cerebras.modelzoo.data_preparation.nlp.pile.download.get_urls_for_tokenizer_files
next
cerebras.modelzoo.data_preparation.nlp.pile.download.main