Introduction
Get Started
Prepare your data
Cerebras PyTorch API
cerebras.pytorch
cerebras.pytorch.amp
cerebras.pytorch.optim
cerebras.pytorch.sparse
cerebras.pytorch.metrics
Cerebras Model Zoo
Cerebras Guides
Fundamentals
Support
Get urls given split of dataset.
split (str) – Split of dataset to get urls for.
List of urls, containing jsonl.zst file names for downloading.
previous
cerebras.modelzoo.data_preparation.nlp.pile.download.get_urls_for_tokenizer_files
next
cerebras.modelzoo.data_preparation.nlp.pile.download.main