CVMar 7, 2024

Image Coding for Machines with Edge Information Learning Using Segment Anything

arXiv:2403.04173v323 citationsh-index: 6Has CodeICIP
Originality Incremental advance
AI Analysis

This addresses the growing demand for efficient image recognition AI by improving compression for machines, though it appears incremental as it builds on existing LIC and Segment Anything models.

The paper tackles image compression for machine vision by proposing SA-ICM, a method that encodes only edge information using Segment Anything, achieving state-of-the-art performance in image compression for recognition tasks and extending it to video compression with SA-NeRV.

Image Coding for Machines (ICM) is an image compression technique for image recognition. This technique is essential due to the growing demand for image recognition AI. In this paper, we propose a method for ICM that focuses on encoding and decoding only the edge information of object parts in an image, which we call SA-ICM. This is an Learned Image Compression (LIC) model trained using edge information created by Segment Anything. Our method can be used for image recognition models with various tasks. SA-ICM is also robust to changes in input data, making it effective for a variety of use cases. Additionally, our method provides benefits from a privacy point of view, as it removes human facial information on the encoder's side, thus protecting one's privacy. Furthermore, this LIC model training method can be used to train Neural Representations for Videos (NeRV), which is a video compression model. By training NeRV using edge information created by Segment Anything, it is possible to create a NeRV that is effective for image recognition (SA-NeRV). Experimental results confirm the advantages of SA-ICM, presenting the best performance in image compression for image recognition. We also show that SA-NeRV is superior to ordinary NeRV in video compression for machines. Code is available at https://github.com/final-0/SA-ICM.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes