CLOct 12, 2024

LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English

T. Y. S. S. Santosh, Cornelius Weiss, Matthias Grabmair

arXiv:2410.09527v114.626 citationsh-index: 13Has CodeNLLP

Originality Incremental advance

AI Analysis

This work addresses the lack of generative task benchmarks in Legal NLP, providing a resource for researchers and practitioners in the legal domain.

The authors introduced LexSumm, a benchmark for legal summarization in English comprising eight datasets from multiple jurisdictions, and LexT5, a legal-oriented sequence-to-sequence model, which they evaluated through zero-shot probing and fine-tuning, revealing abstraction and faithfulness errors in summaries.

In the evolving NLP landscape, benchmarks serve as yardsticks for gauging progress. However, existing Legal NLP benchmarks only focus on predictive tasks, overlooking generative tasks. This work curates LexSumm, a benchmark designed for evaluating legal summarization tasks in English. It comprises eight English legal summarization datasets, from diverse jurisdictions, such as the US, UK, EU and India. Additionally, we release LexT5, legal oriented sequence-to-sequence model, addressing the limitation of the existing BERT-style encoder-only models in the legal domain. We assess its capabilities through zero-shot probing on LegalLAMA and fine-tuning on LexSumm. Our analysis reveals abstraction and faithfulness errors even in summaries generated by zero-shot LLMs, indicating opportunities for further improvements. LexSumm benchmark and LexT5 model are available at https://github.com/TUMLegalTech/LexSumm-LexT5.

View on arXiv PDF Code

Similar