CLFeb 5

Reasoning under Ambiguity: Uncertainty-Aware Multilingual Emotion Classification under Partial Supervision

arXiv:2602.05471v2h-index: 13
Originality Highly original
AI Analysis

This work tackles the problem of biased learning and unreliable predictions in multilingual multi-label emotion classification for knowledge-based systems, especially when dealing with emotional ambiguity and incomplete supervision.

This paper addresses multilingual emotion classification under partial supervision and emotional ambiguity. The authors introduce a framework that explicitly aligns learning with annotation uncertainty, using an entropy-based ambiguity weighting mechanism and a mask-aware objective with positive-unlabeled regularization. This approach consistently improves performance over strong baselines on English, Spanish, and Arabic emotion classification benchmarks.

Contemporary knowledge-based systems increasingly rely on multilingual emotion identification to support intelligent decision-making, yet they face major challenges due to emotional ambiguity and incomplete supervision. Emotion recognition from text is inherently uncertain because multiple emotional states often co-occur and emotion annotations are frequently missing or heterogeneous. Most existing multi-label emotion classification methods assume fully observed labels and rely on deterministic learning objectives, which can lead to biased learning and unreliable predictions under partial supervision. This paper introduces Reasoning under Ambiguity, an uncertainty-aware framework for multilingual multi-label emotion classification that explicitly aligns learning with annotation uncertainty. The proposed approach uses a shared multilingual encoder with language-specific optimization and an entropy-based ambiguity weighting mechanism that down-weights highly ambiguous training instances rather than treating missing labels as negative evidence. A mask-aware objective with positive-unlabeled regularization is further incorporated to enable robust learning under partial supervision. Experiments on English, Spanish, and Arabic emotion classification benchmarks demonstrate consistent improvements over strong baselines across multiple evaluation metrics, along with improved training stability, robustness to annotation sparsity, and enhanced interpretability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes