CL AIJan 23, 2018

Analyzing Language Learned by an Active Question Answering Agent

Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang

arXiv:1801.07537v11.05 citations

Originality Synthesis-oriented

AI Analysis

This provides insights into machine-machine communication in QA systems, but the findings are incremental as they reveal expected behaviors rather than novel solutions.

The paper analyzes the language learned by an ActiveQA agent that uses reinforcement learning to reformulate questions for a black-box QA system, finding that it discovers classical information retrieval techniques like tf-idf re-weighting and stemming rather than semantic transformations.

We analyze the language learned by an agent trained with reinforcement learning as a component of the ActiveQA system [Buck et al., 2017]. In ActiveQA, question answering is framed as a reinforcement learning task in which an agent sits between the user and a black box question-answering system. The agent learns to reformulate the user's questions to elicit the optimal answers. It probes the system with many versions of a question that are generated via a sequence-to-sequence question reformulation model, then aggregates the returned evidence to find the best answer. This process is an instance of \emph{machine-machine} communication. The question reformulation model must adapt its language to increase the quality of the answers returned, matching the language of the question answering system. We find that the agent does not learn transformations that align with semantic intuitions but discovers through learning classical information retrieval techniques such as tf-idf re-weighting and stemming.

View on arXiv PDF

Similar