IRCLJun 13, 2020

Mining Implicit Relevance Feedback from User Behavior for Web Question Answering

arXiv:2006.07581v29 citations
AI Analysis

This work addresses the high labeling cost for training QA systems in commercial search engines, particularly benefiting low-resource languages, and is incremental as it adapts existing user behavior models from web documents to passages.

The paper tackles the problem of reducing the need for large amounts of labeled training data in web-scale Question Answering systems by mining implicit relevance feedback from user behavior logs, showing that their approach significantly improves passage ranking accuracy without extra human labels and reduces labeling costs, especially for low-resource languages.

Training and refreshing a web-scale Question Answering (QA) system for a multi-lingual commercial search engine often requires a huge amount of training examples. One principled idea is to mine implicit relevance feedback from user behavior recorded in search engine logs. All previous works on mining implicit relevance feedback target at relevance of web documents rather than passages. Due to several unique characteristics of QA tasks, the existing user behavior models for web documents cannot be applied to infer passage relevance. In this paper, we make the first study to explore the correlation between user behavior and passage relevance, and propose a novel approach for mining training data for Web QA. We conduct extensive experiments on four test datasets and the results show our approach significantly improves the accuracy of passage ranking without extra human labeled data. In practice, this work has proved effective to substantially reduce the human labeling cost for the QA service in a global commercial search engine, especially for languages with low resources. Our techniques have been deployed in multi-language services.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes