Load Test Cases
The Harness is able to load the saved test cases, test configuration and test data.
# Load saved test cases, test configuration, test data
harness = Harness.load(
save_dir="path/to/saved_test_folder",
model={"model": "gpt-3.5-turbo-instruct","hub":"openai"},
task='question-answering',
load_testcases=True
)
Once the harness is loaded, the test cases can then be run with any new model by calling harness.run().
Load Model response
Load model responses along with the test harness configuration.
harness = Harness.load(
save_dir="saved_model_reponse",
model={"model": "gpt-3.5-turbo-instruct","hub":"openai"},
task="question-answering",
load_model_response=True
)
After loading the harness, you can re-evaluate the model responses using alternative metrics