DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms
This addresses the urgent need for reliable DeepFake detection tools, offering a novel approach that generalizes across different generation techniques and degradations.
The paper tackles the problem of detecting DeepFake videos by monitoring heartbeat rhythms, which are disrupted in synthetic content, and demonstrates effectiveness with experiments on FaceForensics++ and DFDC-preview datasets.
As the GAN-based face image and video generation techniques, widely known as DeepFakes, have become more and more matured and realistic, there comes a pressing and urgent demand for effective DeepFakes detectors. Motivated by the fact that remote visual photoplethysmography (PPG) is made possible by monitoring the minuscule periodic changes of skin color due to blood pumping through the face, we conjecture that normal heartbeat rhythms found in the real face videos will be disrupted or even entirely broken in a DeepFake video, making it a potentially powerful indicator for DeepFake detection. In this work, we propose DeepRhythm, a DeepFake detection technique that exposes DeepFakes by monitoring the heartbeat rhythms. DeepRhythm utilizes dual-spatial-temporal attention to adapt to dynamically changing face and fake types. Extensive experiments on FaceForensics++ and DFDC-preview datasets have confirmed our conjecture and demonstrated not only the effectiveness, but also the generalization capability of \emph{DeepRhythm} over different datasets by various DeepFakes generation techniques and multifarious challenging degradations.