CLMay 28, 2023

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine Collaboration

arXiv:2305.17696v1227 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for safer AI in handling sensitive topics for users and developers, though it is incremental as it builds on existing datasets and methods.

The authors tackled the problem of large language models generating harmful content in discussions on sensitive issues by creating the SQuARe dataset, a large-scale Korean dataset of 49k sensitive questions with responses, which improved acceptable response generation for models like HyperCLOVA and GPT-3.

The potential social harms that large language models pose, such as generating offensive content and reinforcing biases, are steeply rising. Existing works focus on coping with this concern while interacting with ill-intentioned users, such as those who explicitly make hate speech or elicit harmful responses. However, discussions on sensitive issues can become toxic even if the users are well-intentioned. For safer models in such scenarios, we present the Sensitive Questions and Acceptable Response (SQuARe) dataset, a large-scale Korean dataset of 49k sensitive questions with 42k acceptable and 46k non-acceptable responses. The dataset was constructed leveraging HyperCLOVA in a human-in-the-loop manner based on real news headlines. Experiments show that acceptable response generation significantly improves for HyperCLOVA and GPT-3, demonstrating the efficacy of this dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes