QA & Summarization Notebook

Overview

In the QA/Summarization notebook section, our primary focus is on the evaluation of OpenAI, Ai21, Cohere, Asure-OpenAi models tailored for Question Answering (QA) and Summarization tasks. This involves a meticulous testing process to gauge the models’ efficiency in responding to questions and producing concise and informative summaries. In a noteworthy addition, we’re introducing perturbations to the text during these tasks. By incorporating variations in the input, such as adding perturbations, we aim to observe how the model responses adapt to changes in the text.

Config Used

tests:
  defaults:
    min_pass_rate: 1.0

  robustness:
    add_typo:
      min_pass_rate: 0.70
    lowercase:
      min_pass_rate: 0.70

OpenAi Question-Answering & Summarization

In this notebook we are testing OpenAI gpt-3.5-turbo-instruct Model For Question Answering and Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
OpenAI QA/Summarization	OpenAI	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`

AI21 Question-Answering & Summarization

In this notebook we are testing AI21 j2-jumbo-instruct Model For Question Answering and Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
Question-Answering & Summarization	AI21	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`

Cohere Question-Answering & Summarization

In this notebook we are testing Cohere command-xlarge-nightly Model For Question Answering and Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
Question-Answering & Summarization	Cohere	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`

Hugging Face Inference API Question-Answering & Summarization

In this notebook we are testing Hugging Face Inference API google/flan-t5-small Model For Question Answering and google/pegasus-newsroom for Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
Question-Answering & Summarization	Hugging Face Inference API	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`

Hugging Face Hub Question-Answering & Summarization

In this notebook we are testing Hugging Face Hub facebook/opt-1.3b Model For Question Answering and Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
Question-Answering & Summarization	Hugging Face Hub	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`

Azure-OpenAI Question-Answering & Summarization

In this notebook we are testing Azure-OpenAI gpt-3.5-turbo-instruct Model For Question Answering and Summarization task.

Open in Collab

Category	Hub	Task	Datset Used	Open In Colab
Question-Answering & Summarization	Azure-OpenAI	Question-Answering/Summarization	`BoolQ`, `NQ-Open`, `Xsum`