LGAug 10, 2025

Parity Requires Unified Input Dependence and Negative Eigenvalues in SSMs

arXiv:2508.07395v11 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses a foundational limitation in efficient SSMs for sequence modeling, though it is incremental as it builds on prior theoretical work.

The paper tackled the problem of state-tracking capability in SSMs, showing that combining input-independent and non-negative SSMs fails to solve parity tasks, requiring both input dependence and negative eigenvalues.

Recent work has shown that LRNN models such as S4D, Mamba, and DeltaNet lack state-tracking capability due to either time-invariant transition matrices or restricted eigenvalue ranges. To address this, input-dependent transition matrices, particularly those that are complex or non-triangular, have been proposed to enhance SSM performance on such tasks. While existing theorems demonstrate that both input-independent and non-negative SSMs are incapable of solving simple state-tracking tasks, such as parity, regardless of depth, they do not explore whether combining these two types in a multilayer SSM could help. We investigate this question for efficient SSMs with diagonal transition matrices and show that such combinations still fail to solve parity. This implies that a recurrence layer must both be input-dependent and include negative eigenvalues. Our experiments support this conclusion by analyzing an SSM model that combines S4D and Mamba layers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes