CLAILGNov 29, 2023

Filtered Semi-Markov CRF

arXiv:2311.18028v1132 citationsh-index: 18Has Code
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in sequence labeling for NLP researchers, offering an incremental improvement over existing methods.

The paper tackles the inefficiency and performance issues of Semi-Markov CRFs for text segmentation tasks like Named Entity Recognition by introducing a filtering step to eliminate irrelevant segments, resulting in improved performance and speed on NER benchmarks.

Semi-Markov CRF has been proposed as an alternative to the traditional Linear Chain CRF for text segmentation tasks such as Named Entity Recognition (NER). Unlike CRF, which treats text segmentation as token-level prediction, Semi-CRF considers segments as the basic unit, making it more expressive. However, Semi-CRF suffers from two major drawbacks: (1) quadratic complexity over sequence length, as it operates on every span of the input sequence, and (2) inferior performance compared to CRF for sequence labeling tasks like NER. In this paper, we introduce Filtered Semi-Markov CRF, a variant of Semi-CRF that addresses these issues by incorporating a filtering step to eliminate irrelevant segments, reducing complexity and search space. Our approach is evaluated on several NER benchmarks, where it outperforms both CRF and Semi-CRF while being significantly faster. The implementation of our method is available on \href{https://github.com/urchade/Filtered-Semi-Markov-CRF}{Github}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes