Multi-Stream Switching for Interactive Virtual Reality Video Streaming
This addresses the problem of efficient VR video delivery for users with head-mounted displays, offering a domain-specific incremental improvement.
The paper tackles the challenge of streaming high-quality 360-degree VR video over bandwidth-limited networks by proposing a multi-stream switching framework that pre-encodes streams for different view ranges and switches based on head rotation, resulting in up to 2.9dB PSNR improvement over a single-stream approach.
Virtual reality (VR) video provides an immersive 360 viewing experience to a user wearing a head-mounted display: as the user rotates his head, correspondingly different fields-of-view (FoV) of the 360 video are rendered for observation. Transmitting the entire 360 video in high quality over bandwidth-constrained networks from server to client for real-time playback is challenging. In this paper we propose a multi-stream switching framework for VR video streaming: the server pre-encodes a set of VR video streams covering different view ranges that account for server-client round trip time (RTT) delay, and during streaming the server transmits and switches streams according to a user's detected head rotation angle. For a given RTT, we formulate an optimization to seek multiple VR streams of different view ranges and the head-angle-to-stream mapping function simultaneously, in order to minimize the expected distortion subject to bandwidth and storage constraints. We propose an alternating algorithm that, at each iteration, computes the optimal streams while keeping the mapping function fixed and vice versa. Experiments show that for the same bandwidth, our multi-stream switching scheme outperforms a non-switching single-stream approach by up to 2.9dB in PSNR.