Skip to content

GradientScalerConfig

Module: fast_llm.engine.optimizer.config

Fields

constantfeature

Type: float or None    Default: None

Constant multiplier applied to the loss. Setting this disables dynamic scaling.

hysteresisfeature

Type: int    Default: 2

Number of failed updates to tolerate before lowering the learning rate in dynamic scaling (fp16).

initialfeature

Type: float    Default: 65536

Initial loss scale for dynamic scaling (fp16).

minimumfeature

Type: float    Default: 1.0

Minimum loss scale for dynamic scaling (fp16).

windowfeature

Type: int    Default: 1000

Interval between dynamic scaling growth (fp16).

Used in