Skip to content

YarnRotaryConfig

Module: fast_llm.layers.attention.rotary.config

Variant of: RotaryConfig — select with type: yarn

Inherits from: DefaultRotaryConfig, RotaryConfig, ModuleConfig

Fields

thetaarchitecture

Type: float    Default: 10000

Scale for the rotary positional embeddings

attention_factorfeature
Type: float or None    Default: None
beta_fastfeature
Type: float    Default: 32.0
beta_slowfeature
Type: float    Default: 1.0
original_context_lengthfeature
Type: int    Default: 8192
scale_factorfeature
Type: float    Default: 8.0