CVFeb 11

DMP-3DAD: Cross-Category 3D Anomaly Detection via Realistic Depth Map Projection with Few Normal Samples

Zi Wang, Katsuya Hotta, Koichiro Kamide, Yawen Zou, Jianjian Qin, Chao Zhang, Jun Yu

arXiv:2602.10806v11.5h-index: 18

Originality Highly original

AI Analysis

This addresses the problem of flexible anomaly detection in 3D point clouds for applications like quality control, offering a simple solution without category-specific training.

The paper tackled cross-category 3D anomaly detection with few normal samples by proposing DMP-3DAD, a training-free framework using realistic depth map projection and a frozen CLIP encoder, achieving state-of-the-art performance on the ShapeNetPart dataset.

Cross-category anomaly detection for 3D point clouds aims to determine whether an unseen object belongs to a target category using only a few normal examples. Most existing methods rely on category-specific training, which limits their flexibility in few-shot scenarios. In this paper, we propose DMP-3DAD, a training-free framework for cross-category 3D anomaly detection based on multi-view realistic depth map projection. Specifically, by converting point clouds into a fixed set of realistic depth images, our method leverages a frozen CLIP visual encoder to extract multi-view representations and performs anomaly detection via weighted feature similarity, which does not require any fine-tuning or category-dependent adaptation. Extensive experiments on the ShapeNetPart dataset demonstrate that DMP-3DAD achieves state-of-the-art performance under few-shot setting. The results show that the proposed approach provides a simple yet effective solution for practical cross-category 3D anomaly detection.

View on arXiv PDF

Similar