Add OCR Typo
This test checks if the NLP model can handle input text with common ocr typos. A ocr typo dictionary is used to apply most common ocr typos to the input data.
add_ocr_typo: min_pass_rate: 0.7 prob: 0.5 # Defaults to 1.0, which means all words will be transformed. parameters: count: 1 # Defaults to 1
You can adjust the level of transformation in the sentence by using the “
prob” parameter, which controls the proportion of words to be changed during
- min_pass_rate (float): Minimum pass rate to pass the test.
- prob (float): Controls the proportion of words to be changed.
- count (int): Number of variations of sentence to be constructed.
|This organization’s art can win tough acts.||Tbis organization’s a^rt c^an w^in tougb acts.|
|Anyone can join our community garden.||Anyone c^an j0in o^ur communitv gardcn.|