CLFeb 22, 2017

BanglaLekha-Isolated: A Comprehensive Bangla Handwritten Character Dataset

arXiv:1703.10661v123 citations
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for Bangla handwriting recognition, which is important for the Bangla-speaking population in Bangladesh and West Bengal, but it is incremental as it primarily provides new data.

The authors introduced BanglaLekha-Isolated, the largest dataset for Bangla handwritten characters, including numerals, basic, and compound characters, collected from diverse locations and age groups in Bangladesh.

Bangla handwriting recognition is becoming a very important issue nowadays. It is potentially a very important task specially for Bangla speaking population of Bangladesh and West Bengal. By keeping that in our mind we are introducing a comprehensive Bangla handwritten character dataset named BanglaLekha-Isolated. This dataset contains Bangla handwritten numerals, basic characters and compound characters. This dataset was collected from multiple geographical location within Bangladesh and includes sample collected from a variety of aged groups. This dataset can also be used for other classification problems i.e: gender, age, district. This is the largest dataset on Bangla handwritten characters yet.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes