langtest.utils.hf_utils.build_dataset#
- build_dataset(dataset_name: str, dataset_subset: str, label_name: str, text_fields: List[str], natural_language_labels: List[str]) Tuple[Dict[str, str], Dict[str, str], bool] #
Uses inputted dataset details to build dictionaries of train/test values.
- Parameters:
dataset_name (str) – The name of the dataset.
dataset_subset (str) – The name of the dataset subset.
label_name (str) – The name of the label.
text_fields (List[str]) – The list of text fields.
natural_language_labels (List[str]) – The list of natural language labels.
- Returns:
A tuple containing train and test dictionaries and a boolean indicating the presence of validation data.
- Return type:
Tuple[Dict[str, str], Dict[str, str], bool]