Skip to content

YarnRotaryConfig

Module: fast_llm.layers.attention.rotary.config

Variant of: RotaryConfig — select with type: yarn

Inherits from: DefaultRotaryConfig, RotaryConfig, ModuleConfig

Fields

attention_factorarchitecture
Type: float or None    Default: None
beta_fastarchitecture
Type: float    Default: 32.0
beta_slowarchitecture
Type: float    Default: 1.0
original_context_lengtharchitecture
Type: int    Default: 8192
scale_factorarchitecture
Type: float    Default: 8.0
thetaarchitecture

Type: float    Default: 10000

Scale for the rotary positional embeddings