CV MMSep 13, 2023

Differentiable JPEG: The Devil is in the Details

Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar

arXiv:2309.06978v412.129 citationsh-index: 44Has Code

Originality Incremental advance

AI Analysis

This addresses the limitation of using JPEG in deep learning pipelines for researchers and practitioners, representing an incremental improvement over prior differentiable approximations.

The paper tackles the problem of JPEG's non-differentiability in deep learning by proposing a novel differentiable JPEG approach that improves over existing methods, achieving an average PSNR gain of 3.47 dB and up to 9.51 dB for strong compression rates.

JPEG remains one of the most widespread lossy image coding methods. However, the non-differentiable nature of JPEG restricts the application in deep learning pipelines. Several differentiable approximations of JPEG have recently been proposed to address this issue. This paper conducts a comprehensive review of existing diff. JPEG approaches and identifies critical details that have been missed by previous methods. To this end, we propose a novel diff. JPEG approach, overcoming previous limitations. Our approach is differentiable w.r.t. the input image, the JPEG quality, the quantization tables, and the color conversion parameters. We evaluate the forward and backward performance of our diff. JPEG approach against existing methods. Additionally, extensive ablations are performed to evaluate crucial design choices. Our proposed diff. JPEG resembles the (non-diff.) reference implementation best, significantly surpassing the recent-best diff. approach by $3.47$dB (PSNR) on average. For strong compression rates, we can even improve PSNR by $9.51$dB. Strong adversarial attack results are yielded by our diff. JPEG, demonstrating the effective gradient approximation. Our code is available at https://github.com/necla-ml/Diff-JPEG.

View on arXiv PDF Code

Similar