CVLGROMay 3

Hybrid Visual Telemetry for Bandwidth-Constrained Robotic Vision: A Pilot Study with HEVC Base Video and JPEG ROI Stills

arXiv:2605.0182617.01 citations
Predicted impact top 92% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For robotic and surveillance systems operating under severe bandwidth constraints, this work provides a foundational methodology for augmenting low-bitrate video with event-driven high-detail stills to enhance object recognition.

This paper formalizes a hybrid visual telemetry scheme for bandwidth-constrained robotic vision, combining a continuous low-bitrate HEVC video stream with selectively transmitted high-detail JPEG stills of regions of interest. The pilot study establishes the paradigm and experimental protocol, showing that hybrid transmission can improve object-level classification refinement under matched total communication budgets.

Bandwidth-constrained robotic and surveillance systems often rely on a single compressed video stream to support both continuous scene awareness and downstream machine perception. In practice, this creates a mismatch: low-bitrate video can preserve motion and coarse context, but often loses the fine local detail needed for reliable object recognition and decision-making. Motivated by a hybrid architecture in which low-resolution video supports dynamic scene understanding while eventdriven high-detail regions of interest (ROIs) support close-up identification and analytics, this paper formalizes a two-channel visual telemetry scheme in which a continuous low-bitrate video stream is augmented by selectively transmitted high-detail still ROIs. This first paper does not attempt to prove the superiority of a new still-image codec. Instead, it establishes the hybrid transmission paradigm itself using a practical and reproducible codec stack: x265/HEVC for the base video stream and JPEG stills for ROI refinement. We formulate the problem as bitrate-constrained information selection for robotic vision and define an experimental protocol in which video-only and hybrid schemes are compared under matched total communication budgets. The study is designed around UAV-oriented datasets, two practical bitrate regimes, several ROI triggering policies, and object-level classification refinement on selectively transmitted ROI stills. The resulting paper lays the methodological foundation for a second-stage investigation of JPEG AI as the semantic still-image channel within the same hybrid architecture.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes