cerebras.modelzoo.data_preparation.nlp.bert#
| Common pre-processing functions for BERTSUM data processing | |
| Preprocessed CSV data generator for BERT pretraining from raw text documents. | |
| Preprocessed CSV data generator for BERT pretraining from raw text documents. | |
| Preprocessed CSV data generator for BERT pretraining from raw text documents. | |
| Preprocessed CSV data generator for BERT pretraining from raw text documents. | |
| Script to write HDF5 files for MLM_only and MLM + NSP datasets. | |
| Common pre-processing functions taken from: https://github.com/NVIDIA/DeepLearningExamples/blob/master/TensorFlow/LanguageModeling/BERT/run_ner.py with minor modifications | |