cerebras.modelzoo.data.common.h5_map_dataset.dataset#

Classes

HDF5Dataset

Dynamically read samples from disk for using mapping paradigms.

HDF5DatasetConfig

MLMHDF5Dataset

Dataset class to handle text preprocessing in bert mlm datasets.

MultiModalHDF5Dataset

Dataset class to handle image preprocessing in multimodal datasets.

MultiModalHDF5DatasetConfig

MultimodalSimpleHDF5Dataset

Dataset class to handle image preprocessing in multimodal datasets.

MultimodalSimpleHDF5DatasetConfig