A Computational Framework to Identify Self-Aspects in Text
It addresses the need for systematic NLP analysis of Self-aspects, which could benefit fields like mental health, but it is incremental as it builds on existing psychological concepts and methods.
This Ph.D. proposal aims to develop a computational framework to identify Self-aspects in text, which are underexplored in NLP, by creating an ontology and dataset and evaluating models based on interpretability, accuracy, and other criteria, with planned applications in mental health and phenomenology.
This Ph.D. proposal introduces a plan to develop a computational framework to identify Self-aspects in text. The Self is a multifaceted construct and it is reflected in language. While it is described across disciplines like cognitive science and phenomenology, it remains underexplored in natural language processing (NLP). Many of the aspects of the Self align with psychological and other well-researched phenomena (e.g., those related to mental health), highlighting the need for systematic NLP-based analysis. In line with this, we plan to introduce an ontology of Self-aspects and a gold-standard annotated dataset. Using this foundation, we will develop and evaluate conventional discriminative models, generative large language models, and embedding-based retrieval approaches against four main criteria: interpretability, ground-truth adherence, accuracy, and computational efficiency. Top-performing models will be applied in case studies in mental health and empirical phenomenology.