CLAug 5, 2019

Processamento de linguagem natural em Português e aprendizagem profunda para o domínio de Óleo e Gás

arXiv:1908.01674v22 citations
AI Analysis

It tackles the problem of limited NLP resources for Portuguese in the Oil and Gas industry, but is incremental as it only provides a literature review.

This paper reviews deep learning NLP techniques and their applications for the Oil and Gas domain in Portuguese, addressing the challenge of processing technical vocabulary with scarce public corpora.

Over the last few decades, institutions around the world have been challenged to deal with the sheer volume of information captured in unstructured formats, especially in textual documents. The so called Digital Transformation age, characterized by important technological advances and the advent of disruptive methods in Artificial Intelligence, offers opportunities to make better use of this information. Recent techniques in Natural Language Processing (NLP) with Deep Learning approaches allow to efficiently process a large volume of data in order to obtain relevant information, to identify patterns, classify text, among other applications. In this context, the highly technical vocabulary of Oil and Gas (O&G) domain represents a challenge for these NLP algorithms, in which terms can assume a very different meaning in relation to common sense understanding. The search for suitable mathematical representations and specific models requires a large amount of representative corpora in the O&G domain. However, public access to this material is scarce in the scientific literature, especially considering the Portuguese language. This paper presents a literature review about the main techniques for deep learning NLP and their major applications for O&G domain in Portuguese.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes