CVAIIRFeb 19, 2025

RAPTOR: Refined Approach for Product Table Object Recognition

arXiv:2502.14918v21 citationsh-index: 302025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)
Originality Incremental advance
AI Analysis

This work addresses table extraction challenges for industries using business documents, but it is incremental as it builds on existing DETR-based models with post-processing refinements.

The paper tackles the problem of extracting tables from documents, particularly product tables, by introducing RAPTOR, a modular post-processing system that enhances existing models, resulting in improved precision and structural predictions across various datasets.

Extracting tables from documents is a critical task across various industries, especially on business documents like invoices and reports. Existing systems based on DEtection TRansformer (DETR) such as TAble TRansformer (TATR), offer solutions for Table Detection (TD) and Table Structure Recognition (TSR) but face challenges with diverse table formats and common errors like incorrect area detection and overlapping columns. This research introduces RAPTOR, a modular post-processing system designed to enhance state-of-the-art models for improved table extraction, particularly for product tables. RAPTOR addresses recurrent TD and TSR issues, improving both precision and structural predictions. For TD, we use DETR (trained on ICDAR 2019) and TATR (trained on PubTables-1M and FinTabNet), while TSR only relies on TATR. A Genetic Algorithm is incorporated to optimize RAPTOR's module parameters, using a private dataset of product tables to align with industrial needs. We evaluate our method on two private datasets of product tables, the public DOCILE dataset (which contains tables similar to our target product tables), and the ICDAR 2013 and ICDAR 2019 datasets. The results demonstrate that while our approach excels at product tables, it also maintains reasonable performance across diverse table formats. An ablation study further validates the contribution of each module in our system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes