Xintong Wu

CL
h-index29
4papers
12citations
Novelty39%
AI Score45

4 Papers

HCNov 12, 2025
TaskSense: Cognitive Chain Modeling and Difficulty Estimation for GUI Tasks

Yiwen Yin, Zhian Hu, Xiaoxi Xu et al.

Measuring GUI task difficulty is crucial for user behavior analysis and agent capability evaluation. Yet, existing benchmarks typically quantify difficulty based on motor actions (e.g., step counts), overlooking the cognitive demands underlying task completion. In this work, we propose Cognitive Chain, a novel framework that models task difficulty from a cognitive perspective. A cognitive chain decomposes the cognitive processes preceding a motor action into a sequence of cognitive steps (e.g., finding, deciding, computing), each with a difficulty index grounded in information theories. We develop an LLM-based method to automatically extract cognitive chains from task execution traces. Validation with linear regression shows that our estimated cognitive difficulty correlates well with user completion time (step-level R-square=0.46 after annotation). Assessment of state-of-the-art GUI agents shows reduced success on cognitively demanding tasks, revealing capability gaps and Human-AI consistency patterns. We conclude by discussing potential applications in agent training, capability assessment, and human-agent delegation optimization.

GNMay 1
Trust Dynamics in Cryptocurrency Markets: Centralized vs. Decentralized Exchanges

Xintong Wu, Wanlin Deng, Yutong Quan et al.

Trust mechanisms diverge between centralized and decentralized exchanges, representing distinct sociotechnical governance paradigms. However, quantifying trust dynamics and their redistribution between these architectures remains empirically challenging, limiting understanding of how institutional shocks affect market behavior. The FTX collapse offers a natural experiment to bridge this gap. Through an interdisciplinary approach combining causal inference and computational text analysis, we find significant price declines and capital reallocation from centralized to decentralized exchanges following the event. While sentiment metrics showed no sharp discontinuities, topic modeling and network analysis of Discord communities reveal that seasonal holiday discourse obscured underlying trust concerns in centralized exchange forums. These findings underscore the fragility of institutional trust architectures and demonstrate how mixed methods can illuminate behavioral patterns during systemic crises, offering insights for exchange risk management and regulatory assessment.

CLApr 4
Leveraging Large Language Models for Sentiment Analysis: Multi-Modal Analysis of Decentraland's MANA Token

Xintong Wu, Peiting Tsai, Jing Yuan et al.

Decentraland, a decentralized virtual reality platform operating within the expanding Metaverse ecosystem, utilizes its native MANA token to facilitate virtual asset transactions and governance. This study investigates the integration of Discord community sentiment with multi-modal financial data to enhance cryptocurrency price prediction within virtual world economies. We address: (1) identifying sentiment patterns within Decentraland's Discord community, and (2) evaluating the impact of multi-modal features on token return forecasting. Using a BERT-based large language model for sentiment analysis, we develop two LSTM architectures: a baseline incorporating historical prices and a multi-modal variant integrating sentiment scores, trading volume, and market capitalization. Results indicate predominantly neutral community sentiment with a positive skew. The multi-modal model significantly outperforms the price-only baseline in prediction accuracy. These findings demonstrate the predictive value of community-derived signals for virtual economy forecasting and establish a foundation for future research at the intersection of immersive virtual environments, natural language processing, and cryptocurrency market analysis.

ROMar 9
POIROT: Investigating Direct Tangible vs. Digitally Mediated Interaction and Attitude Moderation in Multi-party Murder Mystery Games

Wen Chen, Rongxi Chen, Shankai Chen et al.

As social robots take on increasingly complex roles like game masters (GMs) in multi-party games, the expectation that physicality universally enhances user experience remains debated. This study challenges the "one-size-fits-all" view of tangible interaction by identifying a critical boundary condition: users' Negative Attitudes towards Robots (NARS). In a between-subjects experiment (N = 67), a custom-built robot GM facilitated a multi-party murder mystery game (MMG) by delivering clues either through direct tangible interaction or a digitally mediated interface. Baseline multivariate analysis (MANOVA) showed no significant main effect of delivery modality, confirming that tangibility alone does not guarantee superior engagement. However, primary analysis using multilevel linear models (MLM) revealed a reliable moderation: participants high in NARS experienced markedly lower narrative immersion under tangible delivery, whereas those with low NARS scores showed no such decrement. Qualitative findings further illuminate this divergence: tangibility provides novelty and engagement for some but imposes excessive proxemic friction for anxious users, for whom the digital interface acts as a protective social buffer. These results advance a conditional model of HRI and emphasize the necessity for adaptive systems that can tailor interaction modalities to user predispositions.