cerebras.modelzoo.tools.checkpoint_converters.starcoder.ConfigConverter_StarcoderModel_HF_CS21#
- class cerebras.modelzoo.tools.checkpoint_converters.starcoder.ConfigConverter_StarcoderModel_HF_CS21[source]#
Bases:
cerebras.modelzoo.tools.checkpoint_converters.starcoder.ConfigConverter_StarcoderModel_HF_CS20
CS 2.1 config is the same as CS 2.0
Methods
assert_factory_fn
Determines whether muP -> sP conversion is should be attempted.
convert
convert_all_keys
convert_attention_type
Converts all keys in a config from converter_indices.direction format to the other format.
Attempts to convert the old key by matching against the list of conversion rules.
file_formats
formats
get_converter_indices
load
post_config_convert
pre_config_convert
Copies value that exists at old_state_dict's old_key to new_state_dict's new_key.
save
supports_conversion
Determines whether muP -> sP conversion is supported for this model
- supports_mup_conversion()[source]#
Determines whether muP -> sP conversion is supported for this model
- __init__()#
- attempt_mup_to_sp() bool #
Determines whether muP -> sP conversion is should be attempted. Some HF models (such as BTLM) should not attempt muP -> sP conversion since they can natively handle muP.
- convert_helper(config, converter_indices: cerebras.modelzoo.tools.checkpoint_converters.base_converter.FormatIndices, drop_unmatched_keys: bool = False, no_progress_bar: bool = True, debug: bool = False)#
Converts all keys in a config from converter_indices.direction format to the other format. Conversion will fail if at least one of the keys did not match on any conversion rules and drop_unmatched_keys is not enabled. Returns the newly converted config.
- convert_key(old_key: str, old_state_dict: collections.OrderedDict, new_state_dict: collections.OrderedDict, from_index: int, match_start: int = 0, prefix: str = '', action_fn_args: Optional[dict] = None, debug: bool = False) None #
Attempts to convert the old key by matching against the list of conversion rules. The first rule to match is used for conversion (i.e. even if multiple rules would match, the latter ones are never used). Returns True if a conversion occurred.
- static replaceKey(old_key: str, new_key: str, old_state_dict: collections.OrderedDict, new_state_dict: collections.OrderedDict, from_index: int, action_fn_args: Optional[dict] = None) None #
Copies value that exists at old_state_dict’s old_key to new_state_dict’s new_key.