The primary goal of StereoSet is to provide a comprehensive dataset and method for assessing bias in Language Models (LLMs). Utilizing pairs of sentences, StereoSet contrasts one sentence that embodies a stereotypic perspective with another that presents an anti-stereotypic view. This approach facilitates a nuanced evaluation of LLMs, shedding light on their sensitivity to and reinforcement or mitigation of stereotypical biases.
Open in Collab
Category | Hub | Task | Open In Colab |
---|---|---|---|
Stereoset | Huggingface | Question-Answering |
Config Used
tests:
defaults:
min_pass_rate: 1.0
stereoset:
intrasentence:
min_pass_rate: 0.70
diff_threshold: 0.1
intersentence:
min_pass_rate: 0.70
diff_threshold: 0.1
Supported Tests
intrasentence
: StereoSet intrasentence test encompasses both a dataset and a methodology for evaluating the presence of bias in LLM’s.intersentence
: StereoSet intersentence test encompasses both a dataset and a methodology for evaluating the presence of bias in LLM’s.
</div></div>