CLLGMLJun 15, 2014

Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network

arXiv:1406.3830v1101 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of nuanced semantic representation in NLP and IR, offering a method for document modeling, visualization, and summarization, though it is incremental in building on existing CNN techniques.

The authors tackled the challenge of capturing compositional meaning in documents by introducing a hierarchical convolutional neural network that embeds documents in a low-dimensional vector space while preserving word and sentence order, achieving strong results on document modeling tasks with no feature engineering and a compact model.

Capturing the compositional process which maps the meaning of words to that of documents is a central challenge for researchers in Natural Language Processing and Information Retrieval. We introduce a model that is able to represent the meaning of documents by embedding them in a low dimensional vector space, while preserving distinctions of word and sentence order crucial for capturing nuanced semantics. Our model is based on an extended Dynamic Convolution Neural Network, which learns convolution filters at both the sentence and document level, hierarchically learning to capture and compose low level lexical features into high level semantic concepts. We demonstrate the effectiveness of this model on a range of document modelling tasks, achieving strong results with no feature engineering and with a more compact model. Inspired by recent advances in visualising deep convolution networks for computer vision, we present a novel visualisation technique for our document networks which not only provides insight into their learning process, but also can be interpreted to produce a compelling automatic summarisation system for texts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes