CVDec 7, 2016

Pano2Vid: Automatic Cinematography for Watching 360$^{\circ}$ Videos

arXiv:1612.02335v1136 citations
Originality Incremental advance
AI Analysis

This addresses the challenge for viewers and videographers in navigating 360° videos by automating cinematography, though it is incremental as it builds on existing video analysis techniques.

The paper tackles the problem of automatically generating normal field-of-view videos from 360° videos by directing a virtual camera to select interesting views, and shows that their method produces informative videos comparable to human-edited ones.

We introduce the novel task of Pano2Vid $-$ automatic cinematography in panoramic 360$^{\circ}$ videos. Given a 360$^{\circ}$ video, the goal is to direct an imaginary camera to virtually capture natural-looking normal field-of-view (NFOV) video. By selecting "where to look" within the panorama at each time step, Pano2Vid aims to free both the videographer and the end viewer from the task of determining what to watch. Towards this goal, we first compile a dataset of 360$^{\circ}$ videos downloaded from the web, together with human-edited NFOV camera trajectories to facilitate evaluation. Next, we propose AutoCam, a data-driven approach to solve the Pano2Vid task. AutoCam leverages NFOV web video to discriminatively identify space-time "glimpses" of interest at each time instant, and then uses dynamic programming to select optimal human-like camera trajectories. Through experimental evaluation on multiple newly defined Pano2Vid performance measures against several baselines, we show that our method successfully produces informative videos that could conceivably have been captured by human videographers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes