CVDec 16, 2024

Predicting the Original Appearance of Damaged Historical Documents

arXiv:2412.11634v18 citationsh-index: 16Has CodeAAAI
Originality Incremental advance
AI Analysis

This work addresses the preservation of cultural heritage by enabling the restoration of damaged historical documents, though it appears incremental as it builds on existing diffusion models.

The authors tackled the problem of repairing damaged historical documents by introducing a new task called Historical Document Repair (HDR), proposing a dataset HDR28K and a diffusion-based network DiffHDR, which significantly outperforms existing methods in handling real damaged documents.

Historical documents encompass a wealth of cultural treasures but suffer from severe damages including character missing, paper damage, and ink erosion over time. However, existing document processing methods primarily focus on binarization, enhancement, etc., neglecting the repair of these damages. To this end, we present a new task, termed Historical Document Repair (HDR), which aims to predict the original appearance of damaged historical documents. To fill the gap in this field, we propose a large-scale dataset HDR28K and a diffusion-based network DiffHDR for historical document repair. Specifically, HDR28K contains 28,552 damaged-repaired image pairs with character-level annotations and multi-style degradations. Moreover, DiffHDR augments the vanilla diffusion framework with semantic and spatial information and a meticulously designed character perceptual loss for contextual and visual coherence. Experimental results demonstrate that the proposed DiffHDR trained using HDR28K significantly surpasses existing approaches and exhibits remarkable performance in handling real damaged documents. Notably, DiffHDR can also be extended to document editing and text block generation, showcasing its high flexibility and generalization capacity. We believe this study could pioneer a new direction of document processing and contribute to the inheritance of invaluable cultures and civilizations. The dataset and code is available at https://github.com/yeungchenwa/HDR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes