CVMar 20, 2022

Document Dewarping with Control Points

arXiv:2203.10543v136 citationsh-index: 68Has Code
Originality Incremental advance
AI Analysis

This addresses the issue of poor OCR performance due to document distortion for users of mobile devices, representing an incremental improvement with controllable points for flexibility.

The paper tackles the problem of geometric distortion in document images captured by handheld devices, proposing a method that estimates control and reference points to rectify images and achieves state-of-the-art performance on a real-world dataset.

Document images are now widely captured by handheld devices such as mobile phones. The OCR performance on these images are largely affected due to geometric distortion of the document paper, diverse camera positions and complex backgrounds. In this paper, we propose a simple yet effective approach to rectify distorted document image by estimating control points and reference points. After that, we use interpolation method between control points and reference points to convert sparse mappings to backward mapping, and remap the original distorted document image to the rectified image. Furthermore, control points are controllable to facilitate interaction or subsequent adjustment. We can flexibly select post-processing methods and the number of vertices according to different application scenarios. Experiments show that our approach can rectify document images with various distortion types, and yield state-of-the-art performance on real-world dataset. This paper also provides a training dataset based on control points for document dewarping. Both the code and the dataset are released at https://github.com/gwxie/Document-Dewarping-with-Control-Points.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes