CausalConv1dConfig¶
Module: fast_llm.layers.common.linear.config
Inherits from: AffineLinearBaseConfig, LinearBaseConfig, ModuleConfig
Fields¶
activation—architecture-
Type:
ActivationTypeorNoneDefault:NoneActivation function applied after the convolution. None means no activation.
bias—architecture-
Type: OptionalParameterConfig Default: (sub-fields optional)
Use bias.
kernel_size—architecture-
Type:
intDefault:4Convolution kernel size.
weight—architecture-
Type: ParameterConfig Default: (sub-fields optional)
Configuration for the weight.
lr_scale—feature-
Type:
floatorNoneDefault:NoneScaling factor for the layer learning rate. Combines multiplicatively with the scale set by the parent layer and individual parameters, if applicable.