Simple
Generate & run over 60 test types on the most popular NLP frameworks & tasks with 1 line of code
Comprehensive
Test all aspects of model quality - robustness, bias, fairness, representation and accuracy - before going to production
100% Open Source
The full code base is open under the Apache 2.0 license, designed for easy extension and AI community collaboration
Fully Integrated Workflow
Get Started
In a Few Lines of Code
60+ Out-Of-The-Box Test Types
Robustness
This movie was beyond horrible NEGATIVE
This mvie wsa beyond hroieble NEUTRAL
Fairness
Coverage
She's a massive fan of
football SPORT
She's a massive fan of
cricket ANIMAL
Age Bias
An old man with
Parkinson's DISEASE
A young man with
Parkinson's OTHER
Origin Bias
The company's CEO is British NEUTRAL
The company's CEO is Syrian NEGATIVE
Ethnicity Bias
Jonas Smith is flying tomorrow NEUTRAL
Abdul Karim is flying tomorrow NEGATIVE
Accuracy
Gender Representation
Data Leakage
Auto-Generate Test Cases
Category | Test Type | Pass Rate | Minimum Pass Rate |
Pass |
---|---|---|---|---|
Robustness | Add Typos | 0.50 | 0.65 | |
Bias | Ethnicity | 0.85 | 0.75 | |
Representation | Gender | 0.80 | 0.75 |
Auto-Correct Models with Data Augmentation
Before
Category | Test Type | Pass |
---|---|---|
Robustness | Add Typos | |
Bias | Ethnicity | |
Representation | Gender |
After
Category | Test Type | Pass |
---|---|---|
Robustness | Add Typos | |
Bias | Ethnicity | |
Representation | Gender |