A Study of Sindhi Related and Arabic Script Adapted languages Recognition
This is an incremental survey paper addressing the lack of OCR systems for languages using Arabic script adaptations, such as Sindhi.
This paper surveys existing research on optical character recognition (OCR) for Arabic script and its related adapted languages like Sindhi, which currently lack dedicated OCR systems. It organizes the literature by introducing Sindhi language properties, reviewing OCR techniques used by various researchers, and discussing future work.
A large number of publications are available for the Optical Character Recognition (OCR). Significant researches, as well as articles are present for the Latin, Chinese and Japanese scripts. Arabic script is also one of mature script from OCR perspective. The adaptive languages which share Arabic script or its extended characters; still lacking the OCRs for their language. In this paper we present the efforts of researchers on Arabic and its related and adapted languages. This survey is organized in different sections, in which introduction is followed by properties of Sindhi Language. OCR process techniques and methods used by various researchers are presented. The last section is dedicated for future work and conclusion is also discussed.