CLAug 8, 2023

Large Language Model Prompt Chaining for Long Legal Document Classification

arXiv:2308.04138v13.320 citationsh-index: 10

Originality Incremental advance

AI Analysis

This addresses the challenge of legal document classification for practitioners, but it is incremental as it builds on existing prompting and chaining techniques.

The study tackled the problem of classifying long legal documents with domain-specific language by using prompt chaining to decompose the task into summarization, semantic search, and labeling steps, resulting in enhanced performance over zero-shot methods and surpassing ChatGPT's micro-F1 score with smaller models.

Prompting is used to guide or steer a language model in generating an appropriate response that is consistent with the desired outcome. Chaining is a strategy used to decompose complex tasks into smaller, manageable components. In this study, we utilize prompt chaining for extensive legal document classification tasks, which present difficulties due to their intricate domain-specific language and considerable length. Our approach begins with the creation of a concise summary of the original document, followed by a semantic search for related exemplar texts and their corresponding annotations from a training corpus. Finally, we prompt for a label - based on the task - to assign, by leveraging the in-context learning from the few-shot prompt. We demonstrate that through prompt chaining, we can not only enhance the performance over zero-shot, but also surpass the micro-F1 score achieved by larger models, such as ChatGPT zero-shot, using smaller models.

View on arXiv PDF

Similar