cerebras.modelzoo.common.run_cstorch_flow.GradScalerParams#
- class cerebras.modelzoo.common.run_cstorch_flow.GradScalerParams(loss_scaling_factor=1.0, initial_loss_scale=None, steps_per_increase=2000, min_loss_scale=None, max_loss_scale=None, max_gradient_norm=None, max_gradient_value=None)[source]#
Bases:
objectDataclass for parsing grad scaler params from optimizer params.
Methods
Returns an instance of GradScalerParams from a dictionary.
Attributes
initial_loss_scaleloss_scaling_factormax_gradient_normmax_gradient_valuemax_loss_scalemin_loss_scalesteps_per_increase