LG AI NCJul 2, 2023

On efficient computation in active inference

Aswin Paul, Noor Sajid, Lancelot Da Costa, Adeel Razi

arXiv:2307.00504v115.519 citationsh-index: 42Has Code

Originality Incremental advance

AI Analysis

This work addresses computational bottlenecks in active inference for researchers and practitioners in AI and neuroscience, offering incremental improvements to enhance scalability and usability in complex environments.

The paper tackled the computational cost and target distribution specification challenges in active inference by introducing a novel planning algorithm with drastically lower complexity and a simplified method for setting target distributions, achieving orders of magnitude improvement in computational efficiency and enabling precise planning even with only final goal states.

Despite being recognized as neurobiologically plausible, active inference faces difficulties when employed to simulate intelligent behaviour in complex environments due to its computational cost and the difficulty of specifying an appropriate target distribution for the agent. This paper introduces two solutions that work in concert to address these limitations. First, we present a novel planning algorithm for finite temporal horizons with drastically lower computational complexity. Second, inspired by Z-learning from control theory literature, we simplify the process of setting an appropriate target distribution for new and existing active inference planning schemes. Our first approach leverages the dynamic programming algorithm, known for its computational efficiency, to minimize the cost function used in planning through the Bellman-optimality principle. Accordingly, our algorithm recursively assesses the expected free energy of actions in the reverse temporal order. This improves computational efficiency by orders of magnitude and allows precise model learning and planning, even under uncertain conditions. Our method simplifies the planning process and shows meaningful behaviour even when specifying only the agent's final goal state. The proposed solutions make defining a target distribution from a goal state straightforward compared to the more complicated task of defining a temporally informed target distribution. The effectiveness of these methods is tested and demonstrated through simulations in standard grid-world tasks. These advances create new opportunities for various applications.

View on arXiv PDF Code

Similar