langtest.transform.utils.create_terminology#

create_terminology(ner_data: DataFrame) Dict[str, List[str]]#

Iterate over the DataFrame to create terminology from the predictions. IOB format converted to the IO.

Parameters:

ner_data – Pandas DataFrame that has 2 column, ‘text’ as string and ‘label’ as list of labels

Returns:

Dictionary of entities and corresponding list of words.