cerebras.modelzoo.data.common.h5_map_dataset.dataset.MultimodalSimpleHDF5Dataset#
- class cerebras.modelzoo.data.common.h5_map_dataset.dataset.MultimodalSimpleHDF5Dataset(*args, **kwargs)[source]#
Bases:
cerebras.modelzoo.data.common.h5_map_dataset.dataset.MultiModalHDF5DatasetDataset class to handle image preprocessing in multimodal datasets.
This class is largely the same as the parent class MultimodalHDF5Dataset except with added support for multiple images and intermingling of text and images.
- Parameters
params (dict) – A dictionary containing parameters that HDF5Dataset accepts along with the following add-ons: - “img_data_dir” (str): the path to the directory containing the images. - “image_data_size” (list[int]): the final C x H x W shape of the image. - “transforms” (list[dict]): a specification of the torchvision transforms.
Methods
generate_sampleload_state_dictmappreprocess_imgstate_dictAttributes
by_sample