Source: What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams

The MedQA is a benchmark dataset of Multiple choice question answering based on the United States Medical License Exams (USMLE). The dataset is collected from the professional medical board exams.

You can see which subsets and splits are available below.

Split Details
test Testing set from the MedQA dataset, containing 1273 question and answers examples.
test-tiny Truncated version of the test set from the MedQA dataset, containing 50 question and answers examples.
