CLOct 22, 2018

Linguistic Legal Concept Extraction in Portuguese

arXiv:1810.09379v1
Originality Synthesis-oriented
AI Analysis

This work incrementally improves natural language processing tools for legal professionals in Portuguese-speaking contexts by expanding a domain-specific knowledge base.

The study addressed the extraction of legal concepts in Portuguese by identifying missing terms from the OpenWordNet-PT knowledge base using a corpus of Bar exam questions and related norms, resulting in an enhanced lexical representation of legal texts.

This work investigates legal concepts and their expression in Portuguese, concentrating on the "Order of Attorneys of Brazil" Bar exam. Using a corpus formed by a collection of multiple-choice questions, three norms related to the Ethics part of the OAB exam, language resources (Princeton WordNet and OpenWordNet-PT) and tools (AntConc and Freeling), we began to investigate the concepts and words missing from our repertory of concepts and words in Portuguese, the knowledge base OpenWordNet-PT. We add these concepts and words to OpenWordNet-PT and hence obtain a representation of these texts that is "contained" in the lexical knowledge base.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes