CVNov 12, 2022

Variational Augmentation for Enhancing Historical Document Image Binarization

arXiv:2211.06581v13 citationsh-index: 45
Originality Incremental advance
AI Analysis

This addresses the lack of large training datasets for degraded historical documents, but it is incremental as it builds on existing deep learning methods.

The paper tackled the problem of historical document image binarization by proposing a two-stage framework that uses variational inference to generate degraded samples and a CNN-based network for binarization, achieving competitive results on DIBCO datasets.

Historical Document Image Binarization is a well-known segmentation problem in image processing. Despite ubiquity, traditional thresholding algorithms achieved limited success on severely degraded document images. With the advent of deep learning, several segmentation models were proposed that made significant progress in the field but were limited by the unavailability of large training datasets. To mitigate this problem, we have proposed a novel two-stage framework -- the first of which comprises a generator that generates degraded samples using variational inference and the second being a CNN-based binarization network that trains on the generated data. We evaluated our framework on a range of DIBCO datasets, where it achieved competitive results against previous state-of-the-art methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes