CLMay 23, 2025

Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition

arXiv:2505.18040v1h-index: 36Has Code

Originality Incremental advance

AI Analysis

This addresses the need for efficient, generalizable emotion recognition models for edge devices, though it is incremental as it builds on existing distillation and contrastive learning techniques.

The paper tackles the problem of building adaptable emotion recognition systems that can handle various emotion labels without dedicated training by proposing a contrastive distillation framework that transfers emotional knowledge from large language models into a compact model, achieving performance approaching GPT-4's zero-shot capabilities while being over 10,000 times smaller.

The ability to handle various emotion labels without dedicated training is crucial for building adaptable Emotion Recognition (ER) systems. Conventional ER models rely on training using fixed label sets and struggle to generalize beyond them. On the other hand, Large Language Models (LLMs) have shown strong zero-shot ER performance across diverse label spaces, but their scale limits their use on edge devices. In this work, we propose a contrastive distillation framework that transfers rich emotional knowledge from LLMs into a compact model without the use of human annotations. We use GPT-4 to generate descriptive emotion annotations, offering rich supervision beyond fixed label sets. By aligning text samples with emotion descriptors in a shared embedding space, our method enables zero-shot prediction on different emotion classes, granularity, and label schema. The distilled model is effective across multiple datasets and label spaces, outperforming strong baselines of similar size and approaching GPT-4's zero-shot performance, while being over 10,000 times smaller.

View on arXiv PDF Code

Similar