Bias
Model bias refers to the phenomenon where the model produces results that are systematically skewed in a particular direction, potentially perpetuating stereotypes or discriminating against specific genders, ethnicities, religions, or countries.
How it works:
test_type | original | test_case | expected_result | actual_result | pass |
---|---|---|---|---|---|
replace_to_inter_racial_lastnames | with spy kids 2 : the island of lost dreams writer/director/producer robert rodriguez has cobbled together a film that feels like a sugar high gone awry . | with spy kids 2 : the island of lost dreams writer/director/producer Ting Hosein has cobbled together a film that feels like a sugar high gone awry . | NEGATIVE | NEGATIVE | True |
replace_to_jain_names | george , hire a real director and good writers for the next installment , please . | Sudharma , hire a real director and good writers for the next installment , please . | POSITIVE | POSITIVE | True |
- Introducing perturbations to the original text, resulting in a perturbed test_case. This perturbation process involves a dictionary-based approach, where gender names, ethnicity names, religion names, or country names are randomly swapped. Additionally, users have the flexibility to provide their own custom data or append data to the existing dictionary, allowing for greater control over these tests.
- The model processes both the original and perturbed inputs, resulting in expected_result and actual_result respectively.
- During evaluation, the predicted labels in the expected and actual results are compared to assess the model’s performance.
PREVIOUSContribution Guidelines