DatasetPreparatorDistributedConfig¶
Module: fast_llm.data.preparation.gpt_memmap.config
Fields¶
backend—optional-
Type:
strDefault:"gloo"Distributed backend to use.
timeout—optional-
Type:
intDefault:3600Timeout in seconds for torch distributed operations. Default is 3600.
rank—expert-
Type:
intDefault:NoneRank of the local process. Typically provided by torchrun or equivalent through the
RANKenvironment variable. world_size—expert-
Type:
intDefault:NoneSize of the world group. Typically provided by torchrun or equivalent through the
WORLD_SIZEenvironment variable.