AIAug 10, 2023

Optical Script Identification for multi-lingual Indic-script

arXiv:2308.05780v1h-index: 2
Originality Synthesis-oriented
AI Analysis

This is an incremental survey paper that may benefit researchers working on optical script identification for Indic languages and other scripts.

This survey paper examines preprocessing and recognition techniques for twelve Indic scripts, which present complex challenges due to similarities in text shape, and provides a comparative analysis of existing algorithms.

Script identification and text recognition are some of the major domains in the application of Artificial Intelligence. In this era of digitalization, the use of digital note-taking has become a common practice. Still, conventional methods of using pen and paper is a prominent way of writing. This leads to the classification of scripts based on the method they are obtained. A survey on the current methodologies and state-of-art methods used for processing and identification would prove beneficial for researchers. The aim of this article is to discuss the advancement in the techniques for script pre-processing and text recognition. In India there are twelve prominent Indic scripts, unlike the English language, these scripts have layers of characteristics. Complex characteristics such as similarity in text shape make them difficult to recognize and analyze, thus this requires advance preprocessing methods for their accurate recognition. A sincere attempt is made in this survey to provide a comparison between all algorithms. We hope that this survey would provide insight to a researcher working not only on Indic scripts but also other languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes