CVMay 18, 2025

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

arXiv:2505.12340v15 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses the problem of accurate state estimation for 3D object tracking in dynamic environments, representing an incremental advance over prior interacting multiple model approaches.

The paper tackles the challenge of 3D object tracking under high maneuverability by proposing DIMM, a framework that decouples motion models across directions and uses a differentiable network for fusion, resulting in a 31.61% to 99.23% improvement in tracking accuracy over existing methods.

State estimation is challenging for 3D object tracking with high maneuverability, as the target's state transition function changes rapidly, irregularly, and is unknown to the estimator. Existing work based on interacting multiple model (IMM) achieves more accurate estimation than single-filter approaches through model combination, aligning appropriate models for different motion modes of the target object over time. However, two limitations of conventional IMM remain unsolved. First, the solution space of the model combination is constrained as the target's diverse kinematic properties in different directions are ignored. Second, the model combination weights calculated by the observation likelihood are not accurate enough due to the measurement uncertainty. In this paper, we propose a novel framework, DIMM, to effectively combine estimates from different motion models in each direction, thus increasing the 3D object tracking accuracy. First, DIMM extends the model combination solution space of conventional IMM from a hyperplane to a hypercube by designing a 3D-decoupled multi-hierarchy filter bank, which describes the target's motion with various-order linear models. Second, DIMM generates more reliable combination weight matrices through a differentiable adaptive fusion network for importance allocation rather than solely relying on the observation likelihood; it contains an attention-based twin delayed deep deterministic policy gradient (TD3) method with a hierarchical reward. Experiments demonstrate that DIMM significantly improves the tracking accuracy of existing state estimation methods by 31.61%~99.23%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes