Models / Gpt¶
Classes¶
| Class | Description |
|---|---|
| GPTBaseModelConfig | |
| GPTModelConfig | Configuration for the GPT model, including distributed, multi-stage, and HuggingFace checkpoint formats |
| GPTTrainerConfig | Top-level configuration for training a GPT model. Entry point for fast-llm train gpt |
| PretrainedGPTModelConfig | Configuration for a GPT model together with an optional pretrained checkpoint to load |