CLSep 12, 2017

Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields

arXiv:1709.03637v21086 citations
Originality Highly original
AI Analysis

This addresses a bottleneck in NLP tasks where CRFs are used, offering a novel extension to improve performance without excessive computational cost.

The paper tackled the limitation of conditional random fields (CRFs) in capturing long-range dependencies by integrating external memory, resulting in substantial improvements over strong baselines in two tasks.

Despite successful applications across a broad range of NLP tasks, conditional random fields ("CRFs"), in particular the linear-chain variant, are only able to model local features. While this has important benefits in terms of inference tractability, it limits the ability of the model to capture long-range dependencies between items. Attempts to extend CRFs to capture long-range dependencies have largely come at the cost of computational complexity and approximate inference. In this work, we propose an extension to CRFs by integrating external memory, taking inspiration from memory networks, thereby allowing CRFs to incorporate information far beyond neighbouring steps. Experiments across two tasks show substantial improvements over strong CRF and LSTM baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes