cerebras.modelzoo.data.nlp.bert.BertClassifierDataProcessor.SST2DataProcessorConfig#
- class cerebras.modelzoo.data.nlp.bert.BertClassifierDataProcessor.SST2DataProcessorConfig(*args, **kwargs)[source]#
Bases:
cerebras.modelzoo.data.nlp.bert.BertClassifierDataProcessor.ClassifierDataProcessorConfig
Methods
check_for_deprecated_fields
check_literal_discriminator_field
copy
get_orig_class
get_orig_class_args
get_vocab_file
model_copy
model_post_init
post_init
Attributes
attn_mask_pad_id
batch_size
The batch size.
data_dir
Path to the data files to use.
discriminator
discriminator_value
do_lower
Flag to lower case the texts.
drop_last
Whether to drop last batch of epoch if it's an incomplete batch.
input_pad_id
is_training
Whether the data processor is used for training or validation.
labels_pad_id
max_sequence_length
model_config
num_workers
The number of PyTorch processes used in the dataloader.
persistent_workers
Whether or not to keep workers persistent between epochs.
prefetch_factor
The number of batches to prefetch in the dataloader.
shuffle
Whether or not to shuffle the dataset.
shuffle_seed
The seed used for deterministic shuffling.
vocab_file
Path to the vocabulary file.
data_processor