cerebras.modelzoo.data_preparation.data_preprocessing.finetuning_token_generator.FinetuningTokenGenerator#
- class cerebras.modelzoo.data_preparation.data_preprocessing.finetuning_token_generator.FinetuningTokenGenerator(params, tokenizer, eos_id, pad_id)[source]#
Bases:
objectMethods
create_features_finetuningcreate_features_multimodalTokenize and encode the doc for text summarization.
get_tokenized_semantic_regionspad_to_mslparse_semantic_data_arraytokenize_data