AS AI SDSep 23, 2024

Safe Guard: an LLM-agent for Real-time Voice-based Hate Speech Detection in Social Virtual Reality

Yiwen Xu, Qinyang Hou, Hongyu Wan, Mirjana Prpa

arXiv:2409.15623v16 citationsh-index: 6

Originality Synthesis-oriented

AI Analysis

This addresses safety concerns for users in social VR environments, though it is an incremental application of existing methods to a new domain.

The paper tackles hate speech detection in voice-based social VR interactions by developing Safe Guard, an LLM-agent using GPT and audio features, which reduces false positives compared to existing approaches.

In this paper, we present Safe Guard, an LLM-agent for the detection of hate speech in voice-based interactions in social VR (VRChat). Our system leverages Open AI GPT and audio feature extraction for real-time voice interactions. We contribute a system design and evaluation of the system that demonstrates the capability of our approach in detecting hate speech, and reducing false positives compared to currently available approaches. Our results indicate the potential of LLM-based agents in creating safer virtual environments and set the groundwork for further advancements in LLM-driven moderation approaches.

View on arXiv PDF

Similar