CLMar 27, 2025

Datasets for Depression Modeling in Social Media: An Overview

arXiv:2503.21513v114 citationsh-index: 20Has CodeCLPsych
Originality Synthesis-oriented
AI Analysis

It provides a resource for researchers working on depression modeling, but it is incremental as it primarily aggregates existing datasets without introducing new methods or data.

This paper compiles a list of datasets published from 2019 to 2024 for analyzing and predicting depression using social media data, aiming to support early-career researchers and facilitate interdisciplinary studies.

Depression is the most common mental health disorder, and its prevalence increased during the COVID-19 pandemic. As one of the most extensively researched psychological conditions, recent research has increasingly focused on leveraging social media data to enhance traditional methods of depression screening. This paper addresses the growing interest in interdisciplinary research on depression, and aims to support early-career researchers by providing a comprehensive and up-to-date list of datasets for analyzing and predicting depression through social media data. We present an overview of datasets published between 2019 and 2024. We also make the comprehensive list of datasets available online as a continuously updated resource, with the hope that it will facilitate further interdisciplinary research into the linguistic expressions of depression on social media.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes