CVSep 26, 2025

Rate-Distortion Optimized Communication for Collaborative Perception

arXiv:2509.21994v1h-index: 7
Originality Highly original
AI Analysis

This addresses the challenge of limited bandwidth in multi-agent systems for applications like autonomous driving, offering a theoretical foundation and practical improvements over prior empirical approaches.

The paper tackles the problem of optimizing communication for collaborative perception among multiple agents by introducing a rate-distortion theory to analyze performance-communication trade-offs, resulting in a framework that achieves state-of-the-art accuracy on benchmarks while reducing communication volume by up to 108 times.

Collaborative perception emphasizes enhancing environmental understanding by enabling multiple agents to share visual information with limited bandwidth resources. While prior work has explored the empirical trade-off between task performance and communication volume, a significant gap remains in the theoretical foundation. To fill this gap, we draw on information theory and introduce a pragmatic rate-distortion theory for multi-agent collaboration, specifically formulated to analyze performance-communication trade-off in goal-oriented multi-agent systems. This theory concretizes two key conditions for designing optimal communication strategies: supplying pragmatically relevant information and transmitting redundancy-less messages. Guided by these two conditions, we propose RDcomm, a communication-efficient collaborative perception framework that introduces two key innovations: i) task entropy discrete coding, which assigns features with task-relevant codeword-lengths to maximize the efficiency in supplying pragmatic information; ii) mutual-information-driven message selection, which utilizes mutual information neural estimation to approach the optimal redundancy-less condition. Experiments on 3D object detection and BEV segmentation demonstrate that RDcomm achieves state-of-the-art accuracy on DAIR-V2X and OPV2V, while reducing communication volume by up to 108 times. The code will be released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes