CVAug 14, 2023

The Devil in the Details: Simple and Effective Optical Flow Synthetic Data Generation

arXiv:2308.07378v11 citationsh-index: 33
Originality Incremental advance
AI Analysis

This addresses the domain gap issue in optical flow for computer vision researchers, but it is incremental as it builds on existing synthetic data methods.

The paper tackles the problem of generating synthetic data for optical flow estimation by showing that simple synthetic data with elementary operations can achieve realism, and it proposes using occlusion masks to suppress gradients in occluded regions. The result is that RAFT trained on their dataset outperforms the original on MPI Sintel and KITTI 2015 benchmarks.

Recent work on dense optical flow has shown significant progress, primarily in a supervised learning manner requiring a large amount of labeled data. Due to the expensiveness of obtaining large scale real-world data, computer graphics are typically leveraged for constructing datasets. However, there is a common belief that synthetic-to-real domain gaps limit generalization to real scenes. In this paper, we show that the required characteristics in an optical flow dataset are rather simple and present a simpler synthetic data generation method that achieves a certain level of realism with compositions of elementary operations. With 2D motion-based datasets, we systematically analyze the simplest yet critical factors for generating synthetic datasets. Furthermore, we propose a novel method of utilizing occlusion masks in a supervised method and observe that suppressing gradients on occluded regions serves as a powerful initial state in the curriculum learning sense. The RAFT network initially trained on our dataset outperforms the original RAFT on the two most challenging online benchmarks, MPI Sintel and KITTI 2015.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes