YarnRotaryConfig¶
Module: fast_llm.layers.attention.rotary.config
Variant of: RotaryConfig — select with type: yarn
Inherits from: DefaultRotaryConfig, RotaryConfig, ModuleConfig
Fields¶
theta—architecture-
Type:
floatDefault:10000Scale for the rotary positional embeddings
attention_factor—feature- Type:
floatorNoneDefault:None beta_fast—feature- Type:
floatDefault:32.0 beta_slow—feature- Type:
floatDefault:1.0 original_context_length—feature- Type:
intDefault:8192 scale_factor—feature- Type:
floatDefault:8.0