CVDec 15, 2025

CoRA: A Collaborative Robust Architecture with Hybrid Fusion for Efficient Perception

Gong Chen, Chaokun Zhang, Pengcheng Lv, Xiaohui Xie

arXiv:2512.13191v111.84 citationsh-index: 2

Originality Incremental advance

AI Analysis

This addresses robustness issues in collaborative perception for autonomous systems, offering an incremental improvement over existing methods.

The paper tackles the problem of performance degradation in collaborative perception under adverse communication conditions by proposing CoRA, a hybrid architecture that combines feature-level and object-level fusion, achieving a 19% improvement in AP@0.7 with over 5x less communication volume in extreme scenarios.

Collaborative perception has garnered significant attention as a crucial technology to overcome the perceptual limitations of single-agent systems. Many state-of-the-art (SOTA) methods have achieved communication efficiency and high performance via intermediate fusion. However, they share a critical vulnerability: their performance degrades under adverse communication conditions due to the misalignment induced by data transmission, which severely hampers their practical deployment. To bridge this gap, we re-examine different fusion paradigms, and recover that the strengths of intermediate and late fusion are not a trade-off, but a complementary pairing. Based on this key insight, we propose CoRA, a novel collaborative robust architecture with a hybrid approach to decouple performance from robustness with low communication. It is composed of two components: a feature-level fusion branch and an object-level correction branch. Its first branch selects critical features and fuses them efficiently to ensure both performance and scalability. The second branch leverages semantic relevance to correct spatial displacements, guaranteeing resilience against pose errors. Experiments demonstrate the superiority of CoRA. Under extreme scenarios, CoRA improves upon its baseline performance by approximately 19% in AP@0.7 with more than 5x less communication volume, which makes it a promising solution for robust collaborative perception.

View on arXiv PDF

Similar