CLAIDec 2, 2022

Legal Prompting: Teaching a Language Model to Think Like a Lawyer

arXiv:2212.01326v290 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses the problem of improving legal reasoning in AI for legal professionals, but it is incremental as it builds on existing prompt engineering methods.

The paper tackled legal reasoning tasks by applying prompt engineering techniques to large language models, specifically testing zero-shot/few-shot and fine-tuning approaches on the COLIEE entailment task based on the Japanese Bar exam. It improved accuracy from 0.7037 to 0.8148, beating the 2022 best system of 0.6789 with 0.7431 accuracy.

Large language models that are capable of zero or few-shot prompting approaches have given rise to the new research area of prompt engineering. Recent advances showed that for example Chain-of-Thought (CoT) prompts can improve arithmetic or common sense tasks significantly. We explore how such approaches fare with legal reasoning tasks and take the COLIEE entailment task based on the Japanese Bar exam for testing zero-shot/few-shot and fine-tuning approaches. Our findings show that while CoT prompting and fine-tuning with explanations approaches show improvements, the best results are produced by prompts that are derived from specific legal reasoning techniques such as IRAC (Issue, Rule, Application, Conclusion). Based on our experiments we improve the 2021 best result from 0.7037 accuracy to 0.8148 accuracy and beat the 2022 best system of 0.6789 accuracy with an accuracy of 0.7431.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes