Loss¶

Classes¶

Class	Description
CombinableLossConfig (abstract)	Base for losses that share the vocabulary softmax via `fused_core`, so several can be fused together
LanguageModelDPOLossConfig	Direct Preference Optimization (DPO) loss for alignment
LanguageModelDistillationLossConfig
LanguageModelGRPOLossConfig	Group-Relative Policy Optimization: per-token IS-ratio clipping
LanguageModelGSPOLossConfig	Group Sequence Policy Optimization: sequence-level geometric-mean IS-ratio clipping
LanguageModelLabelEntropyLossConfig
LanguageModelLossConfig (abstract)
LanguageModelPolicyGradientLossConfig (abstract)
LanguageModelZLossConfig	Z-loss regularization to prevent overconfidence
MonolithicLossConfig	A composite loss that runs one vocabulary softmax and shares it across its combinable child losses