CVAISep 9, 2019

HoughNet: neural network architecture for vanishing points detection

arXiv:1909.03812v233 citations
Originality Highly original
AI Analysis

This work addresses the challenge of document image distortion in uncontrolled camera shots, which is incremental as it builds on existing datasets and benchmarks.

The authors tackled the problem of vanishing point detection in document images under uncontrolled conditions by introducing HoughNet, a novel neural network architecture based on a Fast Hough Transform layer. Their method outperformed state-of-the-art results on the ICDAR 2011 dewarping contest, as measured by correctly recognized words using Tesseract OCR.

In this paper we introduce a novel neural network architecture based on Fast Hough Transform layer. The layer of this type allows our neural network to accumulate features from linear areas across the entire image instead of local areas. We demonstrate its potential by solving the problem of vanishing points detection in the images of documents. Such problem occurs when dealing with camera shots of the documents in uncontrolled conditions. In this case, the document image can suffer several specific distortions including projective transform. To train our model, we use MIDV-500 dataset and provide testing results. The strong generalization ability of the suggested method is proven with its applying to a completely different ICDAR 2011 dewarping contest. In previously published papers considering these dataset authors measured the quality of vanishing point detection by counting correctly recognized words with open OCR engine Tesseract. To compare with them, we reproduce this experiment and show that our method outperforms the state-of-the-art result.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes