CLMar 17, 2022

Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss

arXiv:2203.09278v15 citationsh-index: 37
Originality Incremental advance
AI Analysis

This addresses confidence calibration for intent detection, which is important for users who need reliable model confidence, but it is incremental as it builds on existing calibration methods.

The paper tackles the problem of over-confident predictions in intent detection by proposing a model that uses hyperspherical space and a rebalanced accuracy-uncertainty loss, achieving significant improvement in calibration metrics on open datasets.

Data-driven methods have achieved notable performance on intent detection, which is a task to comprehend user queries. Nonetheless, they are controversial for over-confident predictions. In some scenarios, users do not only care about the accuracy but also the confidence of model. Unfortunately, mainstream neural networks are poorly calibrated, with a large gap between accuracy and confidence. To handle this problem defined as confidence calibration, we propose a model using the hyperspherical space and rebalanced accuracy-uncertainty loss. Specifically, we project the label vector onto hyperspherical space uniformly to generate a dense label representation matrix, which mitigates over-confident predictions due to overfitting sparce one-hot label matrix. Besides, we rebalance samples of different accuracy and uncertainty to better guide model training. Experiments on the open datasets verify that our model outperforms the existing calibration methods and achieves a significant improvement on the calibration metric.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes