CL LG NEApr 5, 2015

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Lili Mou, Hao Peng, Ge Li, Yan Xu, Lu Zhang, Zhi Jin

arXiv:1504.01106v522.456 citationsh-index: 97

Originality Highly original

AI Analysis

This work addresses the problem of improving sentence understanding for natural language processing tasks, offering a novel method that outperforms existing approaches.

The paper tackled sentence modeling by proposing a tree-based convolutional neural network (TBCNN) that leverages constituency or dependency trees to extract structural features, achieving state-of-the-art results in sentiment analysis and question classification.

This paper proposes a tree-based convolutional neural network (TBCNN) for discriminative sentence modeling. Our models leverage either constituency trees or dependency trees of sentences. The tree-based convolution process extracts sentences' structural features, and these features are aggregated by max pooling. Such architecture allows short propagation paths between the output layer and underlying feature detectors, which enables effective structural feature learning and extraction. We evaluate our models on two tasks: sentiment analysis and question classification. In both experiments, TBCNN outperforms previous state-of-the-art results, including existing neural networks and dedicated feature/rule engineering. We also make efforts to visualize the tree-based convolution process, shedding light on how our models work.

View on arXiv PDF

Similar