IR CV LGMar 14, 2022

Dataset and Case Studies for Visual Near-Duplicates Detection in the Context of Social Media

arXiv:2203.07167v15.26 citationsh-index: 57Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the need for analyzing social phenomena related to image spread on social media, but it is incremental as it applies existing methods to new data.

The paper tackled the problem of tracking visually-similar content on social media by building a dataset of images and evaluating retrieval methods, achieving promising recall results.

The massive spread of visual content through the web and social media poses both challenges and opportunities. Tracking visually-similar content is an important task for studying and analyzing social phenomena related to the spread of such content. In this paper, we address this need by building a dataset of social media images and evaluating visual near-duplicates retrieval methods based on image retrieval and several advanced visual feature extraction methods. We evaluate the methods using a large-scale dataset of images we crawl from social media and their manipulated versions we generated, presenting promising results in terms of recall. We demonstrate the potential of this method in two case studies: one that shows the value of creating systems supporting manual content review, and another that demonstrates the usefulness of automatic large-scale data analysis.

View on arXiv PDF Code

Similar