Skip to content

GPTTrainerConfig

Module: fast_llm.models.gpt.config

Variant of: TrainerConfig — select with type: gpt

Variant of: RunnableConfig — select with type: train_gpt

Inherits from: PretrainedGPTModelConfig, TrainerConfig, PretrainedFastLLMModelConfig

Fields

datacore

Type: GPTDataConfig    Default: (sub-fields optional)

Configuration for the dataset and model-independent preprocessing.

modelcore

Type: GPTModelConfig    Default: (sub-fields optional)

Configuration for the Fast-LLM model.

optimizercore

Type: OptimizerConfig    Default: (sub-fields optional)

Configuration for the training optimizer and learning rate schedule.

runcore

Type: RunConfig    Default: (sub-fields optional)

Global properties for the experiment.

schedulecore

Type: ScheduleConfig    Default: (sub-fields optional)

Configuration for the scheduling of each iteration.

trainingcore

Type: TrainingConfig    Default: (sub-fields optional)

Configuration for the training phases and global properties.

callbacksfeature

Type: dict[str, TrainerCallbackConfig]    Default: dict()

Configuration for training callbacks.

pretrainedfeature

Type: CheckpointLoadConfig    Default: (sub-fields optional)

Configuration for loading the configuration and state of a pretrained model.

reference_modelsfeature

Type: dict[str, PretrainedGPTModelConfig]    Default: dict()

Auxiliary models used during training, ex. for knowledge distillation.

profilinglogging

Type: ProfilingConfig    Default: (sub-fields optional)

Configuration for the optional profiling of GPU and CPU CUDA operations.