IRJul 26, 2017

An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures

arXiv:1707.08275v17 citations
Originality Synthesis-oriented
AI Analysis

This work addresses integration challenges for neural reranking models in search systems, but it is incremental as it focuses on implementation details rather than novel algorithmic advances.

The paper explores different integration approaches for a convolutional neural network (CNN) with the Lucene search engine in multi-stage ranking architectures, finding that feedforward evaluation in Java is significantly slower and compiled C++ does not consistently outperform PyTorch in terms of latency and throughput.

We explore different approaches to integrating a simple convolutional neural network (CNN) with the Lucene search engine in a multi-stage ranking architecture. Our models are trained using the PyTorch deep learning toolkit, which is implemented in C/C++ with a Python frontend. One obvious integration strategy is to expose the neural network directly as a service. For this, we use Apache Thrift, a software framework for building scalable cross-language services. In exploring alternative architectures, we observe that once trained, the feedforward evaluation of neural networks is quite straightforward. Therefore, we can extract the parameters of a trained CNN from PyTorch and import the model into Java, taking advantage of the Java Deeplearning4J library for feedforward evaluation. This has the advantage that the entire end-to-end system can be implemented in Java. As a third approach, we can extract the neural network from PyTorch and "compile" it into a C++ program that exposes a Thrift service. We evaluate these alternatives in terms of performance (latency and throughput) as well as ease of integration. Experiments show that feedforward evaluation of the convolutional neural network is significantly slower in Java, while the performance of the compiled C++ network does not consistently beat the PyTorch implementation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes