CVAug 6, 2017

Long Short-Term Memory Kalman Filters:Recurrent Neural Estimators for Pose Regularization

arXiv:1708.01885v1184 citations
Originality Highly original
AI Analysis

This addresses the need for more accurate pose regularization in applications like body joint localization and object tracking, representing a novel method for a known bottleneck.

The authors tackled the problem of noisy one-shot pose estimation by learning dynamic motion and noise models with LSTM instead of using predefined models like constant velocity, achieving state-of-the-art performance across three popular computer vision tasks.

One-shot pose estimation for tasks such as body joint localization, camera pose estimation, and object tracking are generally noisy, and temporal filters have been extensively used for regularization. One of the most widely-used methods is the Kalman filter, which is both extremely simple and general. However, Kalman filters require a motion model and measurement model to be specified a priori, which burdens the modeler and simultaneously demands that we use explicit models that are often only crude approximations of reality. For example, in the pose-estimation tasks mentioned above, it is common to use motion models that assume constant velocity or constant acceleration, and we believe that these simplified representations are severely inhibitive. In this work, we propose to instead learn rich, dynamic representations of the motion and noise models. In particular, we propose learning these models from data using long short term memory, which allows representations that depend on all previous observations and all previous states. We evaluate our method using three of the most popular pose estimation tasks in computer vision, and in all cases we obtain state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes