HCAug 12, 2022
RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live StorytellingJian Liao, Adnan Karim, Shivesh Jadon et al.
We present RealityTalk, a system that augments real-time live presentations with speech-driven interactive virtual elements. Augmented presentations leverage embedded visuals and animation for engaging and expressive storytelling. However, existing tools for live presentations often lack interactivity and improvisation, while creating such effects in video editing tools require significant time and expertise. RealityTalk enables users to create live augmented presentations with real-time speech-driven interactions. The user can interactively prompt, move, and manipulate graphical elements through real-time speech and supporting modalities. Based on our analysis of 177 existing video-edited augmented presentations, we propose a novel set of interaction techniques and then incorporated them into RealityTalk. We evaluate our tool from a presenter's perspective to demonstrate the effectiveness of our system.
IRMar 4
Multi-view Attention Fusion of Heterogeneous Hypergraph with Dynamic Behavioral Profiling for Personalized Learning Resource RecommendationTao Xie, Yan Li, Yongpan Sheng et al.
Hypergraph can capture complex and higher-order dependencies among learners and learning resources in personalized educational recommender systems. Many existing hypergraph-based recommendation approaches underexplored the dynamic behavioral processes inherent to learning and often oversimplified the complementary information embedded across multiple dimensions (i.e. views) within hypergraphs. These limitations compromise both the distinctiveness of learned representations and the model's generalization capabilities, especially under data-sparse conditions typical in educational settings. In this study, we propose a unified model comprising a dynamic behavioral profiling module and a multi-view attention fusion module based on heterogeneous hypergraph construction. The dynamic behavioral profiling module is designed to capture evolving behavioral processes and infer latent higher-order relations crucial for hypergraph completion; The multi-view fusion module cohesively integrates information from distinct relational views, enriching the overall data representation. The proposed model was systematically evaluated on five public benchmark datasets and one real-world, self-constructed dataset. Experimental results demonstrate that the model outperforms baseline methods across most datasets in key metrics; Furthermore, hypergraph completion based on dynamic behavioral profiling contributes significantly to performance gains, though its efficacy is modulated by dataset characteristics. Beyond offline experiments, we implemented a functional prototype system tailored for postgraduate student literature recommendation. A mixed-methods user study was conducted to assess its practical utility. Quantitative analysis revealed significantly higher perceived recommendation quality; Qualitative feedback highlighted enhanced user engagement and satisfaction with the prototype system.
93.9SIApr 10
PERCEIVE: A Benchmark for Personalized Emotion and Communication Behavior Understanding on Social MediaJian Liao, Yujin Zheng, Suge Wang et al.
Current emotion analysis in social media is predominantly author-centric, failing to capture the subjective nature of emotional responses across diverse readers. This paradigm overlooks the crucial link between individual perception, communication behavior, and the underlying social network. To bridge this gap, we introduce PERCEIVE, a novel bilingual (English and Chinese) large-scale benchmark that, to the best of our knowledge, is the first to integrate five critical dimensions for social perception: author-created content, genuine readers' emotional feedback (derived from their comments), communication behavior, user attributes, and the social graph. This benchmark enables a paradigm shift towards truly personalized, reader-centric analysis, where different readers' emotional responses to the same content are naturally captured through their real-world interactions. By annotating emotions from reader comments and synchronously capturing communication intent, PERCEIVE provides a unique resource to model the intrinsic coupling between emotion and behavior, grounded in social context. We establish a comprehensive evaluation protocol, testing state-of-the-art methods, including large language models (LLMs) with advanced reasoning enhancement. Our findings reveal significant shortcomings in existing approaches when handling this multifaceted, user-aware task. PERCEIVE offers a foundational resource and clear direction for future research in socially-intelligent NLP, pushing models towards a more unified understanding of emotion on social media.
CLDec 10, 2024
My Words Imply Your Opinion: Reader Agent-based Propagation Enhancement for Personalized Implicit Emotion AnalysisJian Liao, Yu Feng, Yujin Zheng et al.
The subtlety of emotional expressions makes implicit emotion analysis (IEA) particularly sensitive to user-specific characteristics. Current studies personalize emotion analysis by focusing on the author but neglect the impact of the intended reader on implicit emotional feedback. In this paper, we introduce Personalized IEA (PIEA) and present the RAPPIE model, which addresses subjective variability by incorporating reader feedback. In particular, (1) we create reader agents based on large language models to simulate reader feedback, overcoming the issue of ``spiral of silence effect'' and data incompleteness of real reader reaction. (2) We develop a role-aware multi-view graph learning to model the emotion interactive propagation process in scenarios with sparse reader information. (3) We construct two new PIEA datasets covering English and Chinese social media with detailed user metadata, addressing the text-centric limitation of existing datasets. Extensive experiments show that RAPPIE significantly outperforms state-of-the-art baselines, demonstrating the value of incorporating reader feedback in PIEA.