CVNov 20, 2024

Learning based Ge'ez character handwritten recognition

arXiv:2411.13350v12 citationsh-index: 82024 IEEE International Multi-Conference on Smart Systems & Green Process (IMC-SSGP)
Originality Synthesis-oriented
AI Analysis

This work addresses the digitization of Ge'ez manuscripts for cultural preservation, with implications for historical document access and educational tools, though it is incremental as it applies existing methods to a new domain.

The study tackled the problem of Ge'ez handwritten character recognition, which has been neglected in research, by developing a CNN-LSTM system that achieved new top scores, outperforming eight state-of-the-art methods and human performance on the HHD-Ethiopic dataset.

Ge'ez, an ancient Ethiopic script of cultural and historical significance, has been largely neglected in handwriting recognition research, hindering the digitization of valuable manuscripts. Our study addresses this gap by developing a state-of-the-art Ge'ez handwriting recognition system using Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks. Our approach uses a two-stage recognition process. First, a CNN is trained to recognize individual characters, which then acts as a feature extractor for an LSTM-based system for word recognition. Our dual-stage recognition approach achieves new top scores in Ge'ez handwriting recognition, outperforming eight state-of-the-art methods, which are SVTR, ASTER, and others as well as human performance, as measured in the HHD-Ethiopic dataset work. This research significantly advances the preservation and accessibility of Ge'ez cultural heritage, with implications for historical document digitization, educational tools, and cultural preservation. The code will be released upon acceptance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes