CVMMJan 17, 2025

CLIP-PCQA: Exploring Subjective-Aligned Vision-Language Modeling for Point Cloud Quality Assessment

arXiv:2501.10071v112 citationsh-index: 17AAAI
Originality Incremental advance
AI Analysis

This addresses the challenge of aligning automated quality assessment with human subjective evaluation in point cloud analysis, representing an incremental improvement over existing methods.

The paper tackles the problem of No-Reference Point Cloud Quality Assessment (NR-PCQA) by proposing CLIP-PCQA, a language-driven method that uses discrete quality descriptions and Opinion Score Distribution instead of direct mapping to Mean Opinion Scores, and it outperforms state-of-the-art approaches in experiments.

In recent years, No-Reference Point Cloud Quality Assessment (NR-PCQA) research has achieved significant progress. However, existing methods mostly seek a direct mapping function from visual data to the Mean Opinion Score (MOS), which is contradictory to the mechanism of practical subjective evaluation. To address this, we propose a novel language-driven PCQA method named CLIP-PCQA. Considering that human beings prefer to describe visual quality using discrete quality descriptions (e.g., "excellent" and "poor") rather than specific scores, we adopt a retrieval-based mapping strategy to simulate the process of subjective assessment. More specifically, based on the philosophy of CLIP, we calculate the cosine similarity between the visual features and multiple textual features corresponding to different quality descriptions, in which process an effective contrastive loss and learnable prompts are introduced to enhance the feature extraction. Meanwhile, given the personal limitations and bias in subjective experiments, we further covert the feature similarities into probabilities and consider the Opinion Score Distribution (OSD) rather than a single MOS as the final target. Experimental results show that our CLIP-PCQA outperforms other State-Of-The-Art (SOTA) approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes