LG CO MLNov 23, 2024

Learning state and proposal dynamics in state-space models using differentiable particle filters and neural networks

Benjamin Cox, Santiago Segarra, Victor Elvira

arXiv:2411.15638v22.61 citationsh-index: 4Signal Processing

Originality Incremental advance

AI Analysis

This addresses the challenge of accurate inference in sequential data analysis for applications like signal processing or time-series forecasting, representing a novel hybrid approach rather than a foundational breakthrough.

The authors tackled the problem of performing inference in non-linear state-space models by introducing StateMixNN, a method that uses neural networks to learn proposal and transition distributions as Gaussian mixtures, which significantly improves hidden state recovery compared to state-of-the-art methods, especially in highly non-linear scenarios.

State-space models are a popular statistical framework for analysing sequential data. Within this framework, particle filters are often used to perform inference on non-linear state-space models. We introduce a new method, StateMixNN, that uses a pair of neural networks to learn the proposal distribution and transition distribution of a particle filter. Both distributions are approximated using multivariate Gaussian mixtures. The component means and covariances of these mixtures are learnt as outputs of learned functions. Our method is trained targeting the log-likelihood, thereby requiring only the observation series, and combines the interpretability of state-space models with the flexibility and approximation power of artificial neural networks. The proposed method significantly improves recovery of the hidden state in comparison with the state-of-the-art, showing greater improvement in highly non-linear scenarios.

View on arXiv PDF

Similar