CLOct 8, 2019

An Interactive Machine Translation Framework for Modernizing Historical Documents

arXiv:1910.03355v1
AI Analysis

This addresses the accessibility issue of historical documents for scholars by reducing the effort required for modernization, though it appears incremental as it builds on existing modernization approaches.

The paper tackles the problem of modernizing historical documents by proposing an interactive machine translation framework that allows scholars to collaborate with the machine, achieving significant reductions in human effort needed to produce modernized versions in simulated tests.

Due to the nature of human language, historical documents are hard to comprehend by contemporary people. This limits their accessibility to scholars specialized in the time period in which the documents were written. Modernization aims at breaking this language barrier by generating a new version of a historical document, written in the modern version of the document's original language. However, while it is able to increase the document's comprehension, modernization is still far from producing an error-free version. In this work, we propose a collaborative framework in which a scholar can work together with the machine to generate the new version. We tested our approach on a simulated environment, achieving significant reductions of the human effort needed to produce the modernized version of the document.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes