Skip to content

VisionEncoderConfig

Module: fast_llm.layers.vision.config

Inherits from: BlockConfig, ModuleConfig

Fields

adapterarchitecture

Type: MLPBaseConfig    Default: (sub-fields optional)

Configuration for the adapter layer.

embeddingsarchitecture

Type: PatchEmbeddingsConfig    Default: (sub-fields optional)

Configuration for the patch convolution layer.

encoderarchitecture

Type: BlockSequenceConfig    Default: (sub-fields optional)

Configuration for the vision decoder.

hidden_sizearchitecture

Type: int    Default: 1024

Size of the vision encoder main hidden dimension.

lr_scalefeature

Type: float or None    Default: None

Scaling factor for the layer learning rate. Combines multiplicatively with the scale set by the parent and child layers, if applicable.

normalizationfeature

Type: ImageNormalizationConfig    Default: (sub-fields optional)

Configuration for image normalization during preprocessing.

Used in