cerebras.modelzoo.data.nlp.gpt.HuggingFaceIterableDataProcessorEli5.HuggingFaceIterableDataProcessorEli5Config#
- class cerebras.modelzoo.data.nlp.gpt.HuggingFaceIterableDataProcessorEli5.HuggingFaceIterableDataProcessorEli5Config(*args, **kwargs)[source]#
-
Methods
check_for_deprecated_fieldscheck_literal_discriminator_fieldcopyget_orig_classget_orig_class_argsmodel_copymodel_post_initpost_initAttributes
batch_sizeBatch size.
discriminatordiscriminator_valuedrop_lastIf True and the dataset size is not divisible by the batch size, the last incomplete batch will be dropped.
model_confignum_workersHow many subprocesses to use for data loading.
persistent_workersIf True, the data loader will not shutdown the worker processes after a dataset has been consumed once.
prefetch_factorNumber of batches loaded in advance by each worker.
shuffleFlag to enable data shuffling.
shuffle_bufferSize of shuffle buffer in samples.
shuffle_seedShuffle seed.
splitdata_processor