cerebras.modelzoo.data.common.h5_map_dataset.dataset.MultiModalHDF5Dataset#
- class cerebras.modelzoo.data.common.h5_map_dataset.dataset.MultiModalHDF5Dataset[source]#
- Bases: - cerebras.modelzoo.data.common.h5_map_dataset.dataset.HDF5Dataset- Specialized HDF5 dataset class to handle image preprocessing in multimodal datasets Functionality is largely the same as HDF5Dataset except with added image loading and preprocessing :param params: a dictionary containing the following added fields: - “img_data_dir” (str): the path to the directory containing
- the images. 
 
- “fp16_type” (str): the half dtype cast for the image 
- “image_data_size” (list[int]): the final C x H x W shape of
- the image 
 
- “transforms” (list[dict]): a specification of the torchvision
- transforms 
 
 - Methods - Generates an empty tensor with the same shape and dtype as a sample from its dataset. - map- preprocess_img- Attributes - by_sample- seed- generate_sample()[source]#
- Generates an empty tensor with the same shape and dtype as a sample from its dataset. 
 - __call__(*args: Any, **kwargs: Any) Any#
- Call self as a function. 
 - static __new__(cls, *args: Any, **kwargs: Any) Any#