CVOct 7, 2022

Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections

arXiv:2210.03453v1581 citationsh-index: 20
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of accurate information extraction in purchase documents for businesses, but it is incremental as it builds upon existing deep learning techniques with rule-based enhancements.

The paper tackled the problem of extracting key information from purchase documents by combining deep learning with rule-based corrections, resulting in improved performance over baseline deep learning methods as demonstrated on public and NielsenIQ datasets.

Deep Learning (DL) is dominating the fields of Natural Language Processing (NLP) and Computer Vision (CV) in the recent times. However, DL commonly relies on the availability of large data annotations, so other alternative or complementary pattern-based techniques can help to improve results. In this paper, we build upon Key Information Extraction (KIE) in purchase documents using both DL and rule-based corrections. Our system initially trusts on Optical Character Recognition (OCR) and text understanding based on entity tagging to identify purchase facts of interest (e.g., product codes, descriptions, quantities, or prices). These facts are then linked to a same product group, which is recognized by means of line detection and some grouping heuristics. Once these DL approaches are processed, we contribute several mechanisms consisting of rule-based corrections for improving the baseline DL predictions. We prove the enhancements provided by these rule-based corrections over the baseline DL results in the presented experiments for purchase documents from public and NielsenIQ datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes