LGMay 31, 2021

ACE-NODE: Attentive Co-Evolving Neural Ordinary Differential Equations

arXiv:2105.14953v121 citations
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in NODEs for machine learning practitioners, offering an incremental enhancement to improve performance and stability.

The authors tackled the limitations of Neural Ordinary Differential Equations (NODEs), such as learning only homeomorphic mappings and numerical instability, by introducing ACE-NODE, a method that integrates attention into NODEs through dual co-evolving networks. Their approach outperformed existing baselines by non-trivial margins in experiments.

Neural ordinary differential equations (NODEs) presented a new paradigm to construct (continuous-time) neural networks. While showing several good characteristics in terms of the number of parameters and the flexibility in constructing neural networks, they also have a couple of well-known limitations: i) theoretically NODEs learn homeomorphic mapping functions only, and ii) sometimes NODEs show numerical instability in solving integral problems. To handle this, many enhancements have been proposed. To our knowledge, however, integrating attention into NODEs has been overlooked for a while. To this end, we present a novel method of attentive dual co-evolving NODE (ACE-NODE): one main NODE for a downstream machine learning task and the other for providing attention to the main NODE. Our ACE-NODE supports both pairwise and elementwise attention. In our experiments, our method outperforms existing NODE-based and non-NODE-based baselines in almost all cases by non-trivial margins.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes