Detection and Analysis of Content Creator Collaborations in YouTube Videos using Face- and Speaker-Recognition
This work addresses the challenge of identifying content creator collaborations in YouTube videos, particularly for videos lacking face appearances, but it is incremental as it builds upon an existing framework.
The paper tackled the problem of detecting collaborations in YouTube videos by extending the CATANA framework with active speaker detection and speaker recognition to address poor performance in videos without visible faces, resulting in improved detection accuracy.
This work discusses and implements the application of speaker recognition for the detection of collaborations in YouTube videos. CATANA, an existing framework for detection and analysis of YouTube collaborations, is utilizing face recognition for the detection of collaborators, which naturally performs poor on video-content without appearing faces. This work proposes an extension of CATANA using active speaker detection and speaker recognition to improve the detection accuracy.