CVROMar 20, 2023

DIME-Net: Neural Network-Based Dynamic Intrinsic Parameter Rectification for Cameras with Optical Image Stabilization System

arXiv:2303.11307v11 citationsh-index: 26
Originality Incremental advance
AI Analysis

This addresses a specific issue for mobile device camera applications, such as augmented reality and 3D scanning, by enabling accurate pose estimation at native resolution, though it is an incremental improvement over existing calibration methods.

The paper tackles the problem of dynamic intrinsic parameter changes in cameras with Optical Image Stabilization (OIS), which hinders accurate camera pose estimation and 3D reconstruction, by proposing DIME-Net, a neural network that estimates the intrinsic matrix in real-time, reducing reprojection error by at least 64% on mobile devices.

Optical Image Stabilization (OIS) system in mobile devices reduces image blurring by steering lens to compensate for hand jitters. However, OIS changes intrinsic camera parameters (i.e. $\mathrm{K}$ matrix) dynamically which hinders accurate camera pose estimation or 3D reconstruction. Here we propose a novel neural network-based approach that estimates $\mathrm{K}$ matrix in real-time so that pose estimation or scene reconstruction can be run at camera native resolution for the highest accuracy on mobile devices. Our network design takes gratified projection model discrepancy feature and 3D point positions as inputs and employs a Multi-Layer Perceptron (MLP) to approximate $f_{\mathrm{K}}$ manifold. We also design a unique training scheme for this network by introducing a Back propagated PnP (BPnP) layer so that reprojection error can be adopted as the loss function. The training process utilizes precise calibration patterns for capturing accurate $f_{\mathrm{K}}$ manifold but the trained network can be used anywhere. We name the proposed Dynamic Intrinsic Manifold Estimation network as DIME-Net and have it implemented and tested on three different mobile devices. In all cases, DIME-Net can reduce reprojection error by at least $64\%$ indicating that our design is successful.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes