DUMA: Reading Comprehension with Transposition Thinking
This work addresses multi-choice reading comprehension for AI systems, offering an incremental improvement in matching network design.
The authors tackled multi-choice machine reading comprehension by proposing a dual multi-head co-attention model inspired by human transposition thinking, which boosted pre-trained language models to achieve new state-of-the-art performance on DREAM and RACE benchmarks.
Multi-choice Machine Reading Comprehension (MRC) requires model to decide the correct answer from a set of answer options when given a passage and a question. Thus in addition to a powerful Pre-trained Language Model (PrLM) as encoder, multi-choice MRC especially relies on a matching network design which is supposed to effectively capture the relationships among the triplet of passage, question and answers. While the newer and more powerful PrLMs have shown their mightiness even without the support from a matching network, we propose a new DUal Multi-head Co-Attention (DUMA) model, which is inspired by human's transposition thinking process solving the multi-choice MRC problem: respectively considering each other's focus from the standpoint of passage and question. The proposed DUMA has been shown effective and is capable of generally promoting PrLMs. Our proposed method is evaluated on two benchmark multi-choice MRC tasks, DREAM and RACE, showing that in terms of powerful PrLMs, DUMA can still boost the model to reach new state-of-the-art performance.