Datasets for Depression Modeling in Social Media: An Overview
It provides a resource for researchers working on depression modeling, but it is incremental as it primarily aggregates existing datasets without introducing new methods or data.
This paper compiles a list of datasets published from 2019 to 2024 for analyzing and predicting depression using social media data, aiming to support early-career researchers and facilitate interdisciplinary studies.
Depression is the most common mental health disorder, and its prevalence increased during the COVID-19 pandemic. As one of the most extensively researched psychological conditions, recent research has increasingly focused on leveraging social media data to enhance traditional methods of depression screening. This paper addresses the growing interest in interdisciplinary research on depression, and aims to support early-career researchers by providing a comprehensive and up-to-date list of datasets for analyzing and predicting depression through social media data. We present an overview of datasets published between 2019 and 2024. We also make the comprehensive list of datasets available online as a continuously updated resource, with the hope that it will facilitate further interdisciplinary research into the linguistic expressions of depression on social media.