CVAIOct 8, 2021

Inferring Offensiveness In Images From Natural Language Supervision

arXiv:2110.04222v12 citations
Originality Incremental advance
AI Analysis

This addresses the need for safer and more representative datasets in computer vision, though it is incremental as it builds on existing CLIP models.

The paper tackled the problem of offensive content in large-scale image datasets by proposing an automated curation method using pre-trained transformers and human-annotated examples, resulting in the identification of inappropriate content beyond previously known categories like privacy violations and pornography.

Probing or fine-tuning (large-scale) pre-trained models results in state-of-the-art performance for many NLP tasks and, more recently, even for computer vision tasks when combined with image data. Unfortunately, these approaches also entail severe risks. In particular, large image datasets automatically scraped from the web may contain derogatory terms as categories and offensive images, and may also underrepresent specific classes. Consequently, there is an urgent need to carefully document datasets and curate their content. Unfortunately, this process is tedious and error-prone. We show that pre-trained transformers themselves provide a methodology for the automated curation of large-scale vision datasets. Based on human-annotated examples and the implicit knowledge of a CLIP based model, we demonstrate that one can select relevant prompts for rating the offensiveness of an image. In addition to e.g. privacy violation and pornographic content previously identified in ImageNet, we demonstrate that our approach identifies further inappropriate and potentially offensive content.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes