CV AIAug 28, 2023

RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning

arXiv:2308.14899v17.67 citationsh-index: 36

Originality Incremental advance

AI Analysis

This provides a more rigorous evaluation framework for object-centric learning researchers, though it's incremental as it builds on existing robustness evaluation concepts.

The authors tackled the problem of evaluating robustness in object-centric learning methods by creating the RobustCLEVR benchmark dataset and framework, which uses causal models to generate diverse image corruptions, and found that object-centric methods are not inherently robust to these corruptions, with training on in-distribution corruptions failing to guarantee increased robustness.

Object-centric representation learning offers the potential to overcome limitations of image-level representations by explicitly parsing image scenes into their constituent components. While image-level representations typically lack robustness to natural image corruptions, the robustness of object-centric methods remains largely untested. To address this gap, we present the RobustCLEVR benchmark dataset and evaluation framework. Our framework takes a novel approach to evaluating robustness by enabling the specification of causal dependencies in the image generation process grounded in expert knowledge and capable of producing a wide range of image corruptions unattainable in existing robustness evaluations. Using our framework, we define several causal models of the image corruption process which explicitly encode assumptions about the causal relationships and distributions of each corruption type. We generate dataset variants for each causal model on which we evaluate state-of-the-art object-centric methods. Overall, we find that object-centric methods are not inherently robust to image corruptions. Our causal evaluation approach exposes model sensitivities not observed using conventional evaluation processes, yielding greater insight into robustness differences across algorithms. Lastly, while conventional robustness evaluations view corruptions as out-of-distribution, we use our causal framework to show that even training on in-distribution image corruptions does not guarantee increased model robustness. This work provides a step towards more concrete and substantiated understanding of model performance and deterioration under complex corruption processes of the real-world.

View on arXiv PDF

Similar