CLLGNEApr 5, 2015

Discriminative Neural Sentence Modeling by Tree-Based Convolution

arXiv:1504.01106v522.456 citationsh-index: 97
Originality Highly original
AI Analysis

This work addresses the problem of improving sentence understanding for natural language processing tasks, offering a novel method that outperforms existing approaches.

The paper tackled sentence modeling by proposing a tree-based convolutional neural network (TBCNN) that leverages constituency or dependency trees to extract structural features, achieving state-of-the-art results in sentiment analysis and question classification.

This paper proposes a tree-based convolutional neural network (TBCNN) for discriminative sentence modeling. Our models leverage either constituency trees or dependency trees of sentences. The tree-based convolution process extracts sentences' structural features, and these features are aggregated by max pooling. Such architecture allows short propagation paths between the output layer and underlying feature detectors, which enables effective structural feature learning and extraction. We evaluate our models on two tasks: sentiment analysis and question classification. In both experiments, TBCNN outperforms previous state-of-the-art results, including existing neural networks and dedicated feature/rule engineering. We also make efforts to visualize the tree-based convolution process, shedding light on how our models work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes