AI CLFeb 1, 2018

Adaptive Memory Networks

arXiv:1802.00510v15.65 citations

Originality Incremental advance

AI Analysis

This work addresses efficiency challenges in QA systems, offering a domain-specific improvement with incremental advancements in dynamic network architectures.

The paper tackles the problem of reducing inference times in Question Answering tasks by introducing Adaptive Memory Networks, which dynamically construct hierarchical memory banks based on input complexity, resulting in variable-depth networks that trade off accuracy for performance.

We present Adaptive Memory Networks (AMN) that processes input-question pairs to dynamically construct a network architecture optimized for lower inference times for Question Answering (QA) tasks. AMN processes the input story to extract entities and stores them in memory banks. Starting from a single bank, as the number of input entities increases, AMN learns to create new banks as the entropy in a single bank becomes too high. Hence, after processing an input-question(s) pair, the resulting network represents a hierarchical structure where entities are stored in different banks, distanced by question relevance. At inference, one or few banks are used, creating a tradeoff between accuracy and performance. AMN is enabled by dynamic networks that allow input dependent network creation and efficiency in dynamic mini-batching as well as our novel bank controller that allows learning discrete decision making with high accuracy. In our results, we demonstrate that AMN learns to create variable depth networks depending on task complexity and reduces inference times for QA tasks.

View on arXiv PDF

Similar