CLJan 31, 2021

BNLP: Natural language processing toolkit for Bengali language

arXiv:2102.00405v254 citationsHas Code
AI Analysis

This toolkit addresses the problem of limited NLP resources for Bengali language researchers and developers, though it is incremental as it applies existing methods to a new domain.

The authors tackled the lack of natural language processing tools for Bengali by developing BNLP, an open-source toolkit that provides tokenization, word embedding, POS tagging, and NER tagging with pre-trained models achieving significant results, as evidenced by 16K downloads and 119 stars on GitHub.

BNLP is an open source language processing toolkit for Bengali language consisting with tokenization, word embedding, POS tagging, NER tagging facilities. BNLP provides pre-trained model with high accuracy to do model based tokenization, embedding, POS tagging, NER tagging task for Bengali language. BNLP pre-trained model achieves significant results in Bengali text tokenization, word embedding, POS tagging and NER tagging task. BNLP is using widely in the Bengali research communities with 16K downloads, 119 stars and 31 forks. BNLP is available at https://github.com/sagorbrur/bnlp.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes