Jihye Lee

AI
h-index26
4papers
10citations
Novelty41%
AI Score41

4 Papers

HCJun 2
The Comparative Trap: How Social Comparison Orientation Drives Problematic Generative AI (GenAI) Use

Xuchao Zhang, Jihye Lee

Although Generative AI (GenAI) improves task efficiency in the short term, it creates competitive pressures that perpetuate individuals' fear of being eliminated, thereby increasing the risk of problematic use. Existing research has focused on the perspective of individual psychological vulnerability, but has neglected the social comparison context caused by GenAI. This study examines the direct effects of social comparison orientation on problematic GenAI use and explores their indirect effects via emotional and cognitive mechanisms, grounded in the Person-Affect-Cognition-Execution (I-PACE) model. The research analyzed data from 396 Chinese GenAI users using SEM and bootstrap methods. Findings show that social comparison orientation has a significant direct impact on problematic GenAI use and can additionally influence AI flow and perceived irreplaceability through fear of missing out (FoMO), finally leading to problematic GenAI use.

AIDec 15, 2024
Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems using Large Multimodal Models

Sebastian Gutierrez, Irene Hou, Jihye Lee et al.

Recent advancements in generative AI systems have raised concerns about academic integrity among educators. Beyond excelling at solving programming problems and text-based multiple-choice questions, recent research has also found that large multimodal models (LMMs) can solve Parsons problems based only on an image. However, such problems are still inherently text-based and rely on the capabilities of the models to convert the images of code blocks to their corresponding text. In this paper, we further investigate the capabilities of LMMs to solve graph and tree data structure problems based only on images. To achieve this, we computationally construct and evaluate a novel benchmark dataset comprising 9,072 samples of diverse graph and tree data structure tasks to assess the performance of the GPT-4o, GPT-4v, Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini 1.0 Pro Vision, and Claude 3 model families. GPT-4o and Gemini 1.5 Flash performed best on trees and graphs respectively. GPT-4o achieved 87.6% accuracy on tree samples, while Gemini 1.5 Flash, achieved 56.2% accuracy on graph samples. Our findings highlight the influence of structural and visual variations on model performance. This research not only introduces an LMM benchmark to facilitate replication and further exploration but also underscores the potential of LMMs in solving complex computing problems, with important implications for pedagogy and assessment practices.

MLJul 11, 2025
MIRRAMS: Learning Robust Tabular Models under Unseen Missingness Shifts

Jihye Lee, Minseo Kang, Dongha Kim

The presence of missing values often reflects variations in data collection policies, which may shift across time or locations, even when the underlying feature distribution remains stable. Such shifts in the missingness distribution between training and test inputs pose a significant challenge to achieving robust predictive performance. In this study, we propose a novel deep learning framework designed to address this challenge, particularly in the common yet challenging scenario where the test-time dataset is unseen. We begin by introducing a set of mutual information-based conditions, called MI robustness conditions, which guide the prediction model to extract label-relevant information. This promotes robustness against distributional shifts in missingness at test-time. To enforce these conditions, we design simple yet effective loss terms that collectively define our final objective, called MIRRAMS. Importantly, our method does not rely on any specific missingness assumption such as MCAR, MAR, or MNAR, making it applicable to a broad range of scenarios. Furthermore, it can naturally extend to cases where labels are also missing in training data, by generalizing the framework to a semi-supervised learning setting. Extensive experiments across multiple benchmark tabular datasets demonstrate that MIRRAMS consistently outperforms existing state-of-the-art baselines and maintains stable performance under diverse missingness conditions. Moreover, it achieves superior performance even in fully observed settings, highlighting MIRRAMS as a powerful, off-the-shelf framework for general-purpose tabular learning.

IRApr 5, 2024
Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

Hwiyeol Jo, Taiwoo Park, Hyunwoo Lee et al.

Although there has been a growing interest among industries in integrating generative LLMs into their services, limited experience and scarcity of resources act as a barrier in launching and servicing large-scale LLM-based services. In this paper, we share our experiences in developing and operating generative AI models within a national-scale search engine, with a specific focus on the sensitiveness of user queries. We propose a taxonomy for sensitive search queries, outline our approaches, and present a comprehensive analysis report on sensitive queries from actual users. We believe that our experiences in launching generative AI search systems can contribute to reducing the barrier in building generative LLM-based services.