LanguageModelBatchPreprocessingConfig¶
Module: fast_llm.data.document.config
Inherits from: TokenPreprocessingConfig, LengthPreprocessingConfig, BatchPreprocessingConfig
Fields¶
causal- Type:
boolDefault:True distributed- Type: DistributedConfig Default: (sub-fields optional)
micro_batch_splits- Type:
intDefault:1 phase- Type:
PhaseTypeDefault:"training" predicted_tokens- Type:
intDefault:1 return_cumulative_sequence_lengths- Type:
boolDefault:False return_document_count- Type:
boolDefault:False return_document_index- Type:
boolDefault:False return_label_counts- Type:
boolDefault:False return_max_sequence_lengths- Type:
boolDefault:False return_position_index- Type:
boolDefault:False return_prediction_mask- Type:
boolDefault:False use_grpo_data- Type:
boolDefault:False use_loss_masking_spans- Type:
boolDefault:True use_preference_spans- Type:
boolDefault:False vision_encoder- Type: PatchPreprocessingConfig or
NoneDefault:None vocab_size- Type:
intorNoneDefault:None