cerebras.modelzoo.data.nlp.bert.BertTokenClassifierDataProcessor.BertTokenClassifierDataProcessor#
- class cerebras.modelzoo.data.nlp.bert.BertTokenClassifierDataProcessor.BertTokenClassifierDataProcessor(*args, **kwargs)[source]#
Bases:
torch.utils.data.IterableDataset
Reads csv file containing the input token ids, and label_ids. Creates attention_masks and sedment_ids on the fly
Methods
Classmethod to create the dataloader object.
Generator to read the data in chunks of size of data_buffer.