CLAILGJan 18, 2022

Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration

arXiv:2201.06757v1584 citations
AI Analysis

This work addresses the problem of diacritics restoration for users in English-dominated online environments, offering an incremental improvement with a small-footprint, browser-compatible solution.

The paper tackles diacritics restoration for Latin-alphabet languages, particularly Hungarian, by proposing a lightweight 1D dilated convolutional neural network that operates at the character level, achieving competitive performance with similarly sized and larger models while enabling local web browser execution.

Diacritics restoration has become a ubiquitous task in the Latin-alphabet-based English-dominated Internet language environment. In this paper, we describe a small footprint 1D dilated convolution-based approach which operates on a character-level. We find that solutions based on 1D dilated convolutional neural networks are competitive alternatives to models based on recursive neural networks or linguistic modeling for the task of diacritics restoration. Our solution surpasses the performance of similarly sized models and is also competitive with larger models. A special feature of our solution is that it even runs locally in a web browser. We also provide a working example of this browser-based implementation. Our model is evaluated on different corpora, with emphasis on the Hungarian language. We performed comparative measurements about the generalization power of the model in relation to three Hungarian corpora. We also analyzed the errors to understand the limitation of corpus-based self-supervised training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes