CVSep 20, 2019

Document Rectification and Illumination Correction using a Patch-based CNN

Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander

arXiv:1909.09470v113.689 citations

Originality Incremental advance

AI Analysis

This work addresses document image preprocessing for improved readability and OCR performance, presenting an incremental advancement over prior learning-based methods.

The paper tackles the problem of rectifying distorted document images and correcting uneven illumination from a single input, achieving significant improvements in OCR accuracy on synthetic and real datasets.

We propose a novel learning method to rectify document images with various distortion types from a single input image. As opposed to previous learning-based methods, our approach seeks to first learn the distortion flow on input image patches rather than the entire image. We then present a robust technique to stitch the patch results into the rectified document by processing in the gradient domain. Furthermore, we propose a second network to correct the uneven illumination, further improving the readability and OCR accuracy. Due to the less complex distortion present on the smaller image patches, our patch-based approach followed by stitching and illumination correction can significantly improve the overall accuracy in both the synthetic and real datasets.

View on arXiv PDF

Similar