CVAICLCRLGJul 8, 2024

T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

arXiv:2407.05965v359 citationsh-index: 42
AI Analysis

This addresses safety risks for users and developers of text-to-video models, but it is incremental as it extends existing safety evaluation concepts from text-to-image to video.

The paper tackles the lack of comprehensive safety evaluation for text-to-video generative models by introducing T2VSafetyBench, a benchmark with 12 safety aspects and a malicious prompt dataset, finding that no single model excels in all aspects and highlighting a trade-off between usability and safety.

The recent development of Sora leads to a new era in text-to-video (T2V) generation. Along with this comes the rising concern about its security risks. The generated videos may contain illegal or unethical content, and there is a lack of comprehensive quantitative understanding of their safety, posing a challenge to their reliability and practical deployment. Previous evaluations primarily focus on the quality of video generation. While some evaluations of text-to-image models have considered safety, they cover fewer aspects and do not address the unique temporal risk inherent in video generation. To bridge this research gap, we introduce T2VSafetyBench, a new benchmark designed for conducting safety-critical assessments of text-to-video models. We define 12 critical aspects of video generation safety and construct a malicious prompt dataset including real-world prompts, LLM-generated prompts and jailbreak attack-based prompts. Based on our evaluation results, we draw several important findings, including: 1) no single model excels in all aspects, with different models showing various strengths; 2) the correlation between GPT-4 assessments and manual reviews is generally high; 3) there is a trade-off between the usability and safety of text-to-video generative models. This indicates that as the field of video generation rapidly advances, safety risks are set to surge, highlighting the urgency of prioritizing video safety. We hope that T2VSafetyBench can provide insights for better understanding the safety of video generation in the era of generative AI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes