CVAug 22, 2018

A syllable based model for handwriting recognition

arXiv:1808.07277v12 citations
Originality Incremental advance
AI Analysis

This work addresses handwriting recognition for French and English users, presenting an incremental improvement by using syllable models instead of lexicon or character n-grams.

The paper tackles handwriting recognition by introducing a syllable-based modeling approach for French and English, achieving interesting performances on the RIMES and IAM datasets by covering out-of-vocabulary words with a limited syllable set and statistical n-grams.

In this paper, we introduce a new modeling approach of texts for handwriting recognition based on syllables. We propose a supervised syllabification approach for the French and English languages for building a vocabulary of syllables. Statistical n-gram language models of syllables are trained on French and English Wikipedia corpora. The handwriting recognition system, based on optical HMM context independent character models, performs a two pass decoding, integrating the proposed syllabic models. Evaluation is carried out on the French RIMES dataset and English IAM dataset by analyzing the performance for various coverage of the syllable models. We also compare the syllable models with lexicon and character n-gram models. The proposed approach reaches interesting performances thanks to its capacity to cover a large amount of out of vocabulary words working with a limited amount of syllables combined with statistical n-gram of reasonable order.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes