Better Call GPT, Comparing Large Language Models Against Lawyers
This signals a potential disruption in the legal industry by enhancing accessibility and efficiency, though it is incremental in applying existing LLMs to a new domain.
The paper compared Large Language Models (LLMs) to human legal contract reviewers, finding that LLMs matched or exceeded human accuracy in identifying legal issues, completed reviews in seconds versus hours, and reduced costs by 99.97%.
This paper presents a groundbreaking comparison between Large Language Models and traditional legal contract reviewers, Junior Lawyers and Legal Process Outsourcers. We dissect whether LLMs can outperform humans in accuracy, speed, and cost efficiency during contract review. Our empirical analysis benchmarks LLMs against a ground truth set by Senior Lawyers, uncovering that advanced models match or exceed human accuracy in determining legal issues. In speed, LLMs complete reviews in mere seconds, eclipsing the hours required by their human counterparts. Cost wise, LLMs operate at a fraction of the price, offering a staggering 99.97 percent reduction in cost over traditional methods. These results are not just statistics, they signal a seismic shift in legal practice. LLMs stand poised to disrupt the legal industry, enhancing accessibility and efficiency of legal services. Our research asserts that the era of LLM dominance in legal contract review is upon us, challenging the status quo and calling for a reimagined future of legal workflows.