cerebras.modelzoo.tools.checkpoint_converters.bloom_hf_cs.Converter_BloomLMHeadModel_CS19_CS20#
- class cerebras.modelzoo.tools.checkpoint_converters.bloom_hf_cs.Converter_BloomLMHeadModel_CS19_CS20[source]#
Bases:
cerebras.modelzoo.tools.checkpoint_converters.gpt2_hf_cs.Converter_GPT2LMHeadModel_CS18_CS20
Bloom uses the GPT2 backbone.
Methods
convert
convert_all_keys
Converts all keys in a checkpoint from converter_indices.direction format to the other format.
Attempts to convert the old key by matching against the list of conversion rules.
converter_note
extract_model_dict
file_formats
formats
get_config_converter_class
get_converter_indices
init_output_checkpoint
load
match_indices
post_checkpoint_convert
Hook executes right after model conversion.
pre_checkpoint_convert
Hook executes right before model conversion.
Copies value that exists at old_state_dict's old_key to new_state_dict's new_key.
save
supports_conversion
- convert_helper(input_checkpoint, configs, converter_indices, output_checkpoint={}, drop_unmatched_keys=False, no_progress_bar=True, debug=False)#
Converts all keys in a checkpoint from converter_indices.direction format to the other format. Conversion will fail if at least one of the keys did not match on any conversion rules and drop_unmatched_keys is not enabled. Returns the newly converted checkpoint.
- convert_key(old_key, old_state_dict, new_state_dict, from_index, match_start=0, prefix='', action_fn_args=None, debug=False)#
Attempts to convert the old key by matching against the list of conversion rules. The first rule to match is used for conversion (i.e. even if multiple rules would match, the latter ones are never used). Returns True if a conversion occurred.
- post_model_convert(old_state_dict, new_state_dict, configs, converter_indices, drop_unmatched_keys, key_prefix='')#
Hook executes right after model conversion.
- pre_model_convert(old_state_dict, new_state_dict, configs, converter_indices, drop_unmatched_keys)#
Hook executes right before model conversion.
- static replaceKey(old_key, new_key, old_state_dict, new_state_dict, from_index, action_fn_args=None)#
Copies value that exists at old_state_dict’s old_key to new_state_dict’s new_key.