CLSep 9, 2019

Pretrained Language Models for Sequential Sentence Classification

arXiv:1909.04054v21035 citations
AI Analysis

This addresses document-level understanding for NLP applications, but it is incremental as it adapts existing BERT models to a specific task.

The authors tackled the problem of sequential sentence classification by using pretrained language models, specifically BERT, to capture contextual dependencies without hierarchical encoding or CRFs, achieving state-of-the-art results on four datasets.

As a step toward better document-level understanding, we explore classification of a sequence of sentences into their corresponding categories, a task that requires understanding sentences in context of the document. Recent successful models for this task have used hierarchical models to contextualize sentence representations, and Conditional Random Fields (CRFs) to incorporate dependencies between subsequent labels. In this work, we show that pretrained language models, BERT (Devlin et al., 2018) in particular, can be used for this task to capture contextual dependencies without the need for hierarchical encoding nor a CRF. Specifically, we construct a joint sentence representation that allows BERT Transformer layers to directly utilize contextual information from all words in all sentences. Our approach achieves state-of-the-art results on four datasets, including a new dataset of structured scientific abstracts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes