Add label smoothing to loss function, this is a workaround method of label smoothing in our system.
previous
cerebras.modelzoo.common.utils.model.transformer_utils.replace_with_zero_and_neg_inf
next
cerebras.modelzoo.common.utils.model.vocab_utils