This test shuffles the labeled entities in the input to test the models robustness.
To test QA models, we are using QAEval from Langchain where we need to use the model itself or other ML model for evaluation, which can make mistakes.
swap_entities: min_pass_rate: 0.8 prob: 0.5 # Defaults to 1.0, which means all words will be transformed. parameters: count: 1 # Defaults to 1
You can adjust the level of transformation in the sentence by using the “
prob” parameter, which controls the proportion of words to be changed during
- min_pass_rate (float): Minimum pass rate to pass the test.
- prob (float): Controls the proportion of words to be changed.
- count (int): Number of variations of sentence to be constructed.
|I love Paris.||I love Istanbul.|
|Jack is sick today.||Adam is sick today.|