HCJun 2

ReforMe: Re-Shaping Documents with Contextual Prompting and Layout-Aware Propagation

arXiv:2606.0326642.4
Predicted impact top 45% in HC · last 90 daysOriginality Incremental advance
AI Analysis

For users digitizing complex documents with handwritten content and irregular layouts, the system offers a more efficient and controllable digitization workflow.

The paper presents an interactive document digitization system combining layout-aware parsing, OCR, and LLM-based reconstruction with user-driven refinement, including a layout-aware propagation mechanism. A user study (n=12) shows improved correction efficiency and reduced repetitive effort.

Digitizing complex documents with handwritten content, irregular tables, and heterogeneous layouts remains challenging, as traditional Optical Character Recognition (OCR) systems fail to capture writing nuances, author-specific conventions, and document structure, and recent LLM-based approaches lack mechanisms for precise, scalable correction. We present an interactive document digitization system that integrates layout-aware parsing, OCR, and LLM-based reconstruction with user-driven refinement. The system is informed by a formative study that identifies key challenges and interaction needs in real-world digitization workflows. It supports both direct edits and natural-language instructions, and introduces a layout-aware propagation mechanism that generalizes user corrections across structurally similar regions. This enables not only efficient error correction but also document re-shaping into structured, analyzable representations. We evaluate the system through a within-subjects user study (n=12) on real-world documents. Results show improved correction efficiency and reduced repetitive effort, demonstrating more effective and controllable document digitization procedure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes