CL AISep 10, 2023

Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler

Zhijun Chen, Hailong Sun, Wanhao Zhang, Chunyi Xu, Qianren Mao, Pengpeng Chen

arXiv:2309.05086v21.77 citationsh-index: 9Has Code

Originality Incremental advance

AI Analysis

It addresses sequence labeling with weak supervision, a common challenge in NLP, but is incremental as it builds on existing graphical model and neural network techniques.

The paper tackles weakly-supervised sequence labeling by proposing Neural-Hidden-CRF, a neuralized undirected graphical model that integrates BERT for contextual semantics and a hidden CRF layer for label dependencies, achieving new state-of-the-art results with improvements of up to 2.80 F1 points over prior models.

We propose a neuralized undirected graphical model called Neural-Hidden-CRF to solve the weakly-supervised sequence labeling problem. Under the umbrella of probabilistic undirected graph theory, the proposed Neural-Hidden-CRF embedded with a hidden CRF layer models the variables of word sequence, latent ground truth sequence, and weak label sequence with the global perspective that undirected graphical models particularly enjoy. In Neural-Hidden-CRF, we can capitalize on the powerful language model BERT or other deep models to provide rich contextual semantic knowledge to the latent ground truth sequence, and use the hidden CRF layer to capture the internal label dependencies. Neural-Hidden-CRF is conceptually simple and empirically powerful. It obtains new state-of-the-art results on one crowdsourcing benchmark and three weak-supervision benchmarks, including outperforming the recent advanced model CHMM by 2.80 F1 points and 2.23 F1 points in average generalization and inference performance, respectively.

View on arXiv PDF Code

Similar