Skip to content

DatasetDiscoveryConfig

Module: fast_llm.data.preparation.dataset_discovery.config

Variant of: RunnableConfig — select with type: prepare_dataset_discovery

Variant of: DatasetPreparatorConfig — select with type: dataset_discovery

Inherits from: DatasetPreparatorConfig, RunnableConfig

Fields

directorycore

Type: Path    Default: (required)

Directory to search for datasets recursively

outputcore

Type: Path    Default: (required)

Output path for the generated config YAML file

ignore_pathsoptional

Type: list[Path]    Default: list()

List of paths to ignore during dataset discovery (can be absolute or relative to directory)