CVLGJul 26, 2019

DCT-CompCNN: A Novel Image Classification Network Using JPEG Compressed DCT Coefficients

arXiv:1907.11503v125 citations
Originality Incremental advance
AI Analysis

This addresses a domain-specific problem for image processing applications where compressed data is common, but it is incremental as it adapts existing CNN methods to a new input format.

The paper tackles the problem of classifying images directly from JPEG compressed DCT coefficients instead of RGB pixels, and shows that their DCT-CompCNN architecture achieves better performance compared to conventional CNNs on datasets like Dog Vs Cat and CIFAR-10.

The popularity of Convolutional Neural Network (CNN) in the field of Image Processing and Computer Vision has motivated researchers and industrialist experts across the globe to solve different challenges with high accuracy. The simplest way to train a CNN classifier is to directly feed the original RGB pixels images into the network. However, if we intend to classify images directly with its compressed data, the same approach may not work better, like in case of JPEG compressed images. This research paper investigates the issues of modifying the input representation of the JPEG compressed data, and then feeding into the CNN. The architecture is termed as DCT-CompCNN. This novel approach has shown that CNNs can also be trained with JPEG compressed DCT coefficients, and subsequently can produce a better performance in comparison with the conventional CNN approach. The efficiency of the modified input representation is tested with the existing ResNet-50 architecture and the proposed DCT-CompCNN architecture on a public image classification datasets like Dog Vs Cat and CIFAR-10 datasets, reporting a better performance

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes