Thursday, January 5, 2023
Blackman: GPT Will Soon Be Able To Pass The Multistate Bar Exam
Following up on my previous posts (links below): Josh Blackman (South Texas; Google Scholar), Can GPT Pass the Multistate Bar Exam?:
My frequent co-authors, Mike Bommarito and Dan Katz utilized a different software tool from OpenAI, known as GPT-3.5, to answer the multiple choice questions on the Multistate Bar Examination (MBE). If there are four choices, the "baseline guessing rate" would be 25%. With no specific training, GPT scored an overall accuracy rate of 50.3%. That's better than what many law school graduates can achieve. And in particular, GPT reached the average passing rate for two topics: Evidence and Torts. (I'll let Evidence or Torts scholars speculate about why those topics may be easier for AI.) Here is a summary of the results from their paper:
The table and figure clearly show that GPT-3.5 is not yet passing the overall multiple choice exam. However, GPT-3.5 is significantly exceeding the baseline random chance rate of 25%. Furthermore, GPT-3.5 has reached the average passing rate for at least two categories, Evidence and Torts.
On average across all categories, GPT-3.5 is trailing human test-takers by approximately 17%. In the case of Evidence, Torts, and Civil Procedure, this gap is negligible or in the single digits; at 1.5 times the standard error of the mean across our test runs, GPT-3.5 is already at parity with humans for Evidence questions. However, for the remaining categories of Constitutional Law, Real Property, Contracts, and Criminal Law, the gap is much more material, rising as high as 36% in the case of Criminal Law. ...
Overall, we find that GPT-3.5 significantly exceeds our expectations for performance on this task. Despite thousands of hours on related tasks over the last two decades between the authors, we did not expect GPT-3.5 to demonstrate such proficiency in a zero-shot settings with minimal modeling and optimization effort. While our ability to interpret how or why GPT-3.5 chooses between candidate answers is limited by understanding of LLMs and the proprietary nature of GPT, the history of similar problems strongly suggests that an LLM may soon pass the Bar. Based on anecdotal evidence related to GPT-4 and Bloom family of models, it is quite possible that this will occur within the next 0-18 months. ...
Worried yet?
Prior TaxProf Blog coverage:
- A Human Being Wrote This Law Review Article: GPT-3 And The Practice Of Law (May 11, 2022)
- The Implications Of OpenAI’s Assistant For Legal Services And Society (Dec. 7, 2022)
- ChatGPT And Law School Exams (Dec. 29, 2022)
- GPT Will Soon Be Able To Pass The Multistate Bar Exam (Jan. 5, 2023)
- Using ChatGPT To Write Law School Exams, Bar Exams, And Strategic Plans (Jan. 11, 2023)
- ChatGPT Gets B|B- Grade On Wharton MBA Exam (Jan. 24, 2023)
- ChatGPT Gets C+ Grade On Four Minnesota Law School Exams (C- In Tax) (Jan. 24, 2023)
- The Rise Of The Robotic Tax Analyst (Jan. 27, 2023)
- Ryznar: Exams In The Time Of ChatGPT (Feb. 1, 2023)
- Bishop Posts Two Papers On ChatGPT (Feb. 8, 2023)
- ChatGPT Almost Passed The Bar, But Competent Lawyers Do Much More (Feb. 23, 2023)
- It’s Not Just Our Students: ChatGPT Is Coming For Faculty Scholarship (Feb. 25, 2023)
- New AI Detector Is 97% Effective In Catching Students Cheating With ChatGPT (Feb. 28, 2023)
- It’s Not Just Our Students: ChatGPT Is Coming For Faculty Scholarship (Feb. 25, 2023)
- ChatGPT's Tax Advice Was Wrong 100% Of The Time (Mar. 7, 2023)
- Does ChatGPT Produce Fishy Briefs? (Mar. 8, 2023)
- Colleges (And Law Schools) Are Rushing To Respond To ChatGPT (Mar. 9, 2023)
- Was The Sermon You Heard At Church Today Written By ChatGPT? (Mar. 12, 2023)
- GPT-4 Beats 90% Of Aspiring Lawyers On The Bar Exam (Mar. 17, 2023)
- ChatGPT Thinks I Am Way More Interesting Than I Am (Mar. 22, 2023)
- Should ChatGPT Be In Law School? (Mar. 30, 2023)
- Merritt: GPT-4 On Legal Education And Lawyer Licensing (Apr. 4, 2023)
- Turnitin Plagiarism Detector Will Catch Students Who Cheat With ChatGPT With 98% Accuracy (Apr. 5, 2023)
- ChatGPT Gets 148 (37th Percentile) And 157 (70th Percentile) On The LSAT (Apr. 6, 2023)
- AI Tools for Lawyers: A Practical Guide (Apr. 6, 2023)
https://taxprof.typepad.com/taxprof_blog/2023/01/gpt-will-soon-be-able-to-pass-the-multistate-bar-exam.html