LG AIMar 29, 2015

Towards Easier and Faster Sequence Labeling for Natural Language Processing: A Search-based Probabilistic Online Learning Framework (SAPO)

Xu Sun, Shuming Ma, Yi Zhang, Xuancheng Ren

arXiv:1503.08381v43.97 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of slow training and lack of search-based optimization in probabilistic methods for NLP practitioners, though it appears incremental as it combines elements of existing approaches.

The paper tackles the trade-off between accuracy and training speed in sequence labeling by proposing a search-based probabilistic online learning framework (SAPO) that achieves better accuracy than CRF and BiLSTM while offering fast training and convergence guarantees.

There are two major approaches for sequence labeling. One is the probabilistic gradient-based methods such as conditional random fields (CRF) and neural networks (e.g., RNN), which have high accuracy but drawbacks: slow training, and no support of search-based optimization (which is important in many cases). The other is the search-based learning methods such as structured perceptron and margin infused relaxed algorithm (MIRA), which have fast training but also drawbacks: low accuracy, no probabilistic information, and non-convergence in real-world tasks. We propose a novel and "easy" solution, a search-based probabilistic online learning method, to address most of those issues. The method is "easy", because the optimization algorithm at the training stage is as simple as the decoding algorithm at the test stage. This method searches the output candidates, derives probabilities, and conducts efficient online learning. We show that this method with fast training and theoretical guarantee of convergence, which is easy to implement, can support search-based optimization and obtain top accuracy. Experiments on well-known tasks show that our method has better accuracy than CRF and BiLSTM\footnote{The SAPO code is released at \url{https://github.com/lancopku/SAPO}.}.

View on arXiv PDF Code

Similar