The primary goal of the Legal Test is to assess a model’s capacity to determine the case (legal conclusion) that supports a given legal claim presented in a text passage. This evaluation aims to gauge the model’s proficiency in legal reasoning and comprehension.

How it works:

test_type case legal_claim legal_conclusion_A legal_conclusion_B correct_conlusion model_conclusion pass
legal-support See United States v. Franik, 687 F.3d 988, 990 (8th Cir.2012) (where defendant does not raise procedural error, court bypasses review and only reviews substantive reasonableness of sentence for abuse of discretion);……. Upon careful review, we conclude that the district court did not abuse its discretion in sentencing Trice. where defendant does not raise procedural error, court bypasses review and only reviews substantive reasonableness of sentence for abuse of discretion where district court varied downward from Guidelines range, it was ““nearly inconceivable”” that court abused its discretion in not varying downward further a a True

Evaluation Criteria

  • In the evaluation we doing the string comparison between correct_conlusion and model_conclusion, Based on that we are calculating the pass.