IRCLLGMar 21, 2024

M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval

arXiv:2403.14074v181 citationsh-index: 11Has CodeLREC
Originality Incremental advance
AI Analysis

This work addresses a specific challenge in open-domain multi-hop dense retrieval for fact verification, representing an incremental improvement over existing methods.

The paper tackles the problem of suboptimal retrieval performance in dense sentence retrieval by introducing M3, a multi-task mixed-objective learning framework, which achieves state-of-the-art results on the FEVER benchmark dataset.

In recent research, contrastive learning has proven to be a highly effective method for representation learning and is widely used for dense retrieval. However, we identify that relying solely on contrastive learning can lead to suboptimal retrieval performance. On the other hand, despite many retrieval datasets supporting various learning objectives beyond contrastive learning, combining them efficiently in multi-task learning scenarios can be challenging. In this paper, we introduce M3, an advanced recursive Multi-hop dense sentence retrieval system built upon a novel Multi-task Mixed-objective approach for dense text representation learning, addressing the aforementioned challenges. Our approach yields state-of-the-art performance on a large-scale open-domain fact verification benchmark dataset, FEVER. Code and data are available at: https://github.com/TonyBY/M3

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes