Enhancing Multilingual Sentiment Analysis with Explainability for Sinhala, English, and Code-Mixed Content
It addresses the problem of improving brand reputation management for banks by providing explainable sentiment analysis, though it is incremental as it builds on existing transformer models with domain-specific enhancements.
This research tackled sentiment analysis for multilingual banking feedback, including low-resource languages like Sinhala and code-mixed text, by developing a hybrid framework that achieved 92.3% accuracy in English and 88.4% in Sinhala and code-mixed content.
Sentiment analysis is crucial for brand reputation management in the banking sector, where customer feedback spans English, Sinhala, Singlish, and code-mixed text. Existing models struggle with low-resource languages like Sinhala and lack interpretability for practical use. This research develops a hybrid aspect-based sentiment analysis framework that enhances multilingual capabilities with explainable outputs. Using cleaned banking customer reviews, we fine-tune XLM-RoBERTa for Sinhala and code-mixed text, integrate domain-specific lexicon correction, and employ BERT-base-uncased for English. The system classifies sentiment (positive, neutral, negative) with confidence scores, while SHAP and LIME improve interpretability by providing real-time sentiment explanations. Experimental results show that our approaches outperform traditional transformer-based classifiers, achieving 92.3 percent accuracy and an F1-score of 0.89 in English and 88.4 percent in Sinhala and code-mixed content. An explainability analysis reveals key sentiment drivers, improving trust and transparency. A user-friendly interface delivers aspect-wise sentiment insights, ensuring accessibility for businesses. This research contributes to robust, transparent sentiment analysis for financial applications by bridging gaps in multilingual, low-resource NLP and explainability.