ERIT Lightweight Multimodal Dataset for Elderly Emotion Recognition and Multimodal Fusion Evaluation
This dataset enables research on multimodal fusion and emotion recognition for the elderly, an underrepresented group, but it is incremental as it primarily provides new data rather than novel methods.
The authors introduced ERIT, a multimodal dataset of text and images from elderly individuals with emotion labels, to support lightweight multimodal fusion research and address the underrepresentation of elderly emotion recognition in machine learning.
ERIT is a novel multimodal dataset designed to facilitate research in a lightweight multimodal fusion. It contains text and image data collected from videos of elderly individuals reacting to various situations, as well as seven emotion labels for each data sample. Because of the use of labeled images of elderly users reacting emotionally, it is also facilitating research on emotion recognition in an underrepresented age group in machine learning visual emotion recognition. The dataset is validated through comprehensive experiments indicating its importance in neural multimodal fusion research.