cerebras.modelzoo.data_preparation.data_preprocessing.custom_hook_examples.custom_hooks#

Functions

create_unique_hash

fetch_single_image

Fetches an image from the provided URL.

llama3_1_chat_formatted_data_hook

Extract multi-turn conversation data from text formatted with Llama 3.1 chat template and process it into a semantic_data_array format.

obelics_hook

Process obelics dataset examples into a semantic_data_array format.

ultra_chat_common_words_mask_hook

Process common words mask data from an Ultra Chat dataset into a semantic_data_array format.