CVSep 20, 2019

Document Rectification and Illumination Correction using a Patch-based CNN

arXiv:1909.09470v189 citations
Originality Incremental advance
AI Analysis

This work addresses document image preprocessing for improved readability and OCR performance, presenting an incremental advancement over prior learning-based methods.

The paper tackles the problem of rectifying distorted document images and correcting uneven illumination from a single input, achieving significant improvements in OCR accuracy on synthetic and real datasets.

We propose a novel learning method to rectify document images with various distortion types from a single input image. As opposed to previous learning-based methods, our approach seeks to first learn the distortion flow on input image patches rather than the entire image. We then present a robust technique to stitch the patch results into the rectified document by processing in the gradient domain. Furthermore, we propose a second network to correct the uneven illumination, further improving the readability and OCR accuracy. Due to the less complex distortion present on the smaller image patches, our patch-based approach followed by stitching and illumination correction can significantly improve the overall accuracy in both the synthetic and real datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes