CV LGSep 13, 2022

Just Noticeable Difference Modeling for Face Recognition System

Yu Tian, Zhangkai Ni, Baoliang Chen, Shurun Wang, Shiqi Wang, Hanli Wang, Sam Kwong

arXiv:2209.05856v21.43 citationsh-index: 91

Originality Incremental advance

AI Analysis

This work addresses the need for efficient compression in face recognition systems for surveillance and security, though it is incremental as it builds on existing JND concepts applied to a new domain.

The paper tackles the problem of face image compression degrading face recognition performance by introducing a just noticeable difference (JND) model to predict the maximum distortion unnoticeable to the system, achieving higher accuracy in JND prediction and saving more bits while maintaining recognition performance compared to the VVC standard.

High-quality face images are required to guarantee the stability and reliability of automatic face recognition (FR) systems in surveillance and security scenarios. However, a massive amount of face data is usually compressed before being analyzed due to limitations on transmission or storage. The compressed images may lose the powerful identity information, resulting in the performance degradation of the FR system. Herein, we make the first attempt to study just noticeable difference (JND) for the FR system, which can be defined as the maximum distortion that the FR system cannot notice. More specifically, we establish a JND dataset including 3530 original images and 137,670 compressed images generated by advanced reference encoding/decoding software based on the Versatile Video Coding (VVC) standard (VTM-15.0). Subsequently, we develop a novel JND prediction model to directly infer JND images for the FR system. In particular, in order to maximum redundancy removal without impairment of robust identity information, we apply the encoder with multiple feature extraction and attention-based feature decomposition modules to progressively decompose face features into two uncorrelated components, i.e., identity and residual features, via self-supervised learning. Then, the residual feature is fed into the decoder to generate the residual map. Finally, the predicted JND map is obtained by subtracting the residual map from the original image. Experimental results have demonstrated that the proposed model achieves higher accuracy of JND map prediction compared with the state-of-the-art JND models, and is capable of saving more bits while maintaining the performance of the FR system compared with VTM-15.0.

View on arXiv PDF

Similar