Bias

Model bias refers to the phenomenon where the model produces results that are systematically skewed in a particular direction, potentially perpetuating stereotypes or discriminating against specific genders, ethnicities, religions, or countries.

How it works:

test_type original test_case expected_result actual_result pass
replace_to_inter_racial_lastnames with spy kids 2 : the island of lost dreams writer/director/producer robert rodriguez has cobbled together a film that feels like a sugar high gone awry . with spy kids 2 : the island of lost dreams writer/director/producer Ting Hosein has cobbled together a film that feels like a sugar high gone awry . NEGATIVE NEGATIVE True
replace_to_jain_names george , hire a real director and good writers for the next installment , please . Sudharma , hire a real director and good writers for the next installment , please . POSITIVE POSITIVE True
  • Introducing perturbations to the original text, resulting in a perturbed test_case. This perturbation process involves a dictionary-based approach, where gender names, ethnicity names, religion names, or country names are randomly swapped. Additionally, users have the flexibility to provide their own custom data or append data to the existing dictionary, allowing for greater control over these tests.
  • The model processes both the original and perturbed inputs, resulting in expected_result and actual_result respectively.
  • During evaluation, the predicted labels in the expected and actual results are compared to assess the model’s performance.