CLFeb 11, 2022

Hindi/Bengali Sentiment Analysis Using Transfer Learning and Joint Dual Input Learning with Self Attention

arXiv:2202.05457v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses sentiment and hate speech analysis for low-resource languages like Hindi and Bengali, representing an incremental advancement in adapting existing techniques to new datasets.

The paper tackled sentiment analysis and hate speech detection in Hindi and Bengali by applying transfer learning and joint dual input learning with self-attention, achieving improved classification results through methods like parameter sharing and BiLSTM with self-attention.

Sentiment Analysis typically refers to using natural language processing, text analysis and computational linguistics to extract affect and emotion based information from text data. Our work explores how we can effectively use deep neural networks in transfer learning and joint dual input learning settings to effectively classify sentiments and detect hate speech in Hindi and Bengali data. We start by training Word2Vec word embeddings for Hindi \textbf{HASOC dataset} and Bengali hate speech and then train LSTM and subsequently, employ parameter sharing based transfer learning to Bengali sentiment classifiers by reusing and fine-tuning the trained weights of Hindi classifiers with both classifier being used as baseline in our study. Finally, we use BiLSTM with self attention in joint dual input learning setting where we train a single neural network on Hindi and Bengali dataset simultaneously using their respective embeddings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes