Improving Legal Entity Recognition Using a Hybrid Transformer Model and Semantic Filtering Approach
This improves automation for legal workflows like contract analysis, though it is incremental as it builds on existing transformer models.
The paper tackled the problem of Legal Entity Recognition (LER) in legal documents by proposing a hybrid model combining Legal-BERT with semantic filtering, achieving an F1 score of 93.4% on a dataset of 15,000 documents.
Legal Entity Recognition (LER) is critical in automating legal workflows such as contract analysis, compliance monitoring, and litigation support. Existing approaches, including rule-based systems and classical machine learning models, struggle with the complexity of legal documents and domain specificity, particularly in handling ambiguities and nested entity structures. This paper proposes a novel hybrid model that enhances the accuracy and precision of Legal-BERT, a transformer model fine-tuned for legal text processing, by introducing a semantic similarity-based filtering mechanism. We evaluate the model on a dataset of 15,000 annotated legal documents, achieving an F1 score of 93.4%, demonstrating significant improvements in precision and recall over previous methods.