YarnRotaryConfig¶
Module: fast_llm.layers.attention.rotary.config
Variant of: RotaryConfig — select with type: yarn
Inherits from: DefaultRotaryConfig, RotaryConfig, ModuleConfig
Fields¶
attention_factor—architecture- Type:
floatorNoneDefault:None beta_fast—architecture- Type:
floatDefault:32.0 beta_slow—architecture- Type:
floatDefault:1.0 original_context_length—architecture- Type:
intDefault:8192 scale_factor—architecture- Type:
floatDefault:8.0 theta—architecture-
Type:
floatDefault:10000Scale for the rotary positional embeddings