Download The Pile dataset from eye.ai website.
args (argparse namespace) – Arguments for downloading the dataset.
split (str) – The subset of the PILE dataset to download.
previous
cerebras.modelzoo.data_preparation.nlp.pile.download.debug_or_download_individual_file
next
cerebras.modelzoo.data_preparation.nlp.pile.download.download_tokenizer_files