langtest.datahandler.datasource.JSONLDataset#
- class JSONLDataset(file_path: str, task: TaskManager)#
Bases:
BaseDataset
Class to handle JSONL datasets. Subclass of BaseDataset.
- __init__(file_path: str, task: TaskManager) None #
Initializes JSONLDataset object.
- Parameters:
file_path (str) – Path to the data file.
task (str) – name of the task to perform
Methods
__init__
(file_path, task)Initializes JSONLDataset object.
export_data
(data, output_path)Exports the data to the corresponding format and saves it to 'output_path'.
load_data
(*args, **kwargs)Loads data from a JSONL file and format it into a list of Sample.
Loads data from a JSON file into a list
Attributes
COLUMN_NAMES
data_sources
supported_tasks
- export_data(data: List[Sample], output_path: str)#
Exports the data to the corresponding format and saves it to ‘output_path’.
- Parameters:
data (List[Sample]) – data to export
output_path (str) – path to save the data to
- load_data(*args, **kwargs) List[Sample] #
Loads data from a JSONL file and format it into a list of Sample.
- Returns:
Loaded text data.
- Return type:
list[Sample]
- load_raw_data() List[Dict] #
Loads data from a JSON file into a list