CVNov 19, 2024

CLIC: Contrastive Learning Framework for Unsupervised Image Complexity Representation

arXiv:2411.12792v22 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the challenge of quantifying image complexity for computer vision applications, offering an incremental improvement by reducing reliance on costly and biased manual annotations.

The paper tackles the problem of accurately assessing image complexity by proposing CLIC, an unsupervised contrastive learning framework that learns complexity-aware features from unlabeled data, achieving performance competitive with supervised methods when fine-tuned with a small labeled dataset.

As a fundamental visual attribute, image complexity significantly influences both human perception and the performance of computer vision models. However, accurately assessing and quantifying image complexity remains a challenging task. (1) Traditional metrics such as information entropy and compression ratio often yield coarse and unreliable estimates. (2) Data-driven methods require expensive manual annotations and are inevitably affected by human subjective biases. To address these issues, we propose CLIC, an unsupervised framework based on Contrastive Learning for learning Image Complexity representations. CLIC learns complexity-aware features from unlabeled data, thereby eliminating the need for costly labeling. Specifically, we design a novel positive and negative sample selection strategy to enhance the discrimination of complexity features. Additionally, we introduce a complexity-aware loss function guided by image priors to further constrain the learning process. Extensive experiments validate the effectiveness of CLIC in capturing image complexity. When fine-tuned with a small number of labeled samples from IC9600, CLIC achieves performance competitive with supervised methods. Moreover, applying CLIC to downstream tasks consistently improves performance. Notably, both the pretraining and application processes of CLIC are free from subjective bias.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes