CLAug 15, 2024

mhGPT: A Lightweight Generative Pre-Trained Transformer for Mental Health Text Analysis

arXiv:2408.08261v15 citationsh-index: 27
Originality Synthesis-oriented
AI Analysis

This research could advance AI-driven mental health care in low-resource settings, though it is incremental as it adapts existing transformer methods to a specific domain.

The paper tackled the problem of analyzing mental health text with limited hardware by introducing mhGPT, a lightweight transformer trained on mental health data, which outperformed larger models and matched performance with only 1.98 billion parameters and 5% of the dataset.

This paper introduces mhGPT, a lightweight generative pre-trained transformer trained on mental health-related social media and PubMed articles. Fine-tuned for specific mental health tasks, mhGPT was evaluated under limited hardware constraints and compared with state-of-the-art models like MentaLLaMA and Gemma. Despite having only 1.98 billion parameters and using just 5% of the dataset, mhGPT outperformed larger models and matched the performance of models trained on significantly more data. The key contributions include integrating diverse mental health data, creating a custom tokenizer, and optimizing a smaller architecture for low-resource settings. This research could advance AI-driven mental health care, especially in areas with limited computing power.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes