CLNov 12, 2017

Fast Reading Comprehension with ConvNets

Felix Wu, Ni Lao, John Blitzer, Guandao Yang, Kilian Weinberger

arXiv:1711.04352v11.64 citations

Originality Incremental advance

AI Analysis

This addresses latency issues in deploying reading comprehension models, particularly for longer texts, though it is incremental as it adapts an existing method to a known bottleneck.

The paper tackles the bottleneck of slow reading comprehension models by replacing recurrent neural nets with a convolutional architecture, achieving comparable accuracy on two question answering tasks while speeding up inference by up to two orders of magnitude.

State-of-the-art deep reading comprehension models are dominated by recurrent neural nets. Their sequential nature is a natural fit for language, but it also precludes parallelization within an instances and often becomes the bottleneck for deploying such models to latency critical scenarios. This is particularly problematic for longer texts. Here we present a convolutional architecture as an alternative to these recurrent architectures. Using simple dilated convolutional units in place of recurrent ones, we achieve results comparable to the state of the art on two question answering tasks, while at the same time achieving up to two orders of magnitude speedups for question answering.

View on arXiv PDF

Similar