Skip to content

TrainerConfig

Abstract

This class cannot be instantiated directly. Use one of the variants listed below.

Module: fast_llm.engine.training.config

Variant of: RunnableConfig — select with type: train

Inherits from: PretrainedFastLLMModelConfig, ExperimentConfig, RunnableConfig

Fields

datacore

Type: DataConfig    Default: (sub-fields optional)

Configuration for the dataset and model-independent preprocessing.

modelcore

Type: FastLLMModelConfig    Default: (sub-fields optional)

Configuration for the Fast-LLM model.

optimizercore

Type: OptimizerConfig    Default: (sub-fields optional)

Configuration for the training optimizer and learning rate schedule.

runcore

Type: RunConfig    Default: (sub-fields optional)

Global properties for the experiment.

schedulecore

Type: ScheduleConfig    Default: (sub-fields optional)

Configuration for the scheduling of each iteration.

trainingcore

Type: TrainingConfig    Default: (sub-fields optional)

Configuration for the training phases and global properties.

callbacksfeature

Type: dict[str, TrainerCallbackConfig]    Default: dict()

Configuration for training callbacks.

pretrainedfeature

Type: CheckpointLoadConfig    Default: (sub-fields optional)

Configuration for loading the configuration and state of a pretrained model.

reference_modelsfeature

Type: dict[str, PretrainedFastLLMModelConfig]    Default: dict()

Auxiliary models used during training, ex. for knowledge distillation.

profilinglogging

Type: ProfilingConfig    Default: (sub-fields optional)

Configuration for the optional profiling of GPU and CPU CUDA operations.

Variants

Select a variant by setting type: to one of the following values.

type value Class Description
gpt GPTTrainerConfig Top-level configuration for training a GPT model. Entry point for fast-llm train gpt
multimodal MultiModalTrainerConfig