CVFeb 25, 2022

Improving Amharic Handwritten Word Recognition Using Auxiliary Task

arXiv:2202.12687v12 citations
Originality Incremental advance
AI Analysis

This addresses the limited research in Amharic OCR, specifically for handwritten text, which is important for preserving and processing literature in Ethiopia.

The paper tackles Amharic handwritten word recognition by using deep learning with an auxiliary task based on row-wise similarities of the Amharic alphabet, resulting in significant recognition improvement over a baseline method.

Amharic is one of the official languages of the Federal Democratic Republic of Ethiopia. It is one of the languages that use an Ethiopic script which is derived from Gee'z, ancient and currently a liturgical language. Amharic is also one of the most widely used literature-rich languages of Ethiopia. There are very limited innovative and customized research works in Amharic optical character recognition (OCR) in general and Amharic handwritten text recognition in particular. In this study, Amharic handwritten word recognition will be investigated. State-of-the-art deep learning techniques including convolutional neural networks together with recurrent neural networks and connectionist temporal classification (CTC) loss were used to make the recognition in an end-to-end fashion. More importantly, an innovative way of complementing the loss function using the auxiliary task from the row-wise similarities of the Amharic alphabet was tested to show a significant recognition improvement over a baseline method. Such findings will promote innovative problem-specific solutions as well as will open insight to a generalized solution that emerges from problem-specific domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes