CLIRLGMLJan 11, 2018

Stochastic Learning of Nonstationary Kernels for Natural Language Modeling

arXiv:1801.03911v27 citations
Originality Incremental advance
AI Analysis

This work addresses computational challenges in natural language processing for structured inference tasks, such as extracting biological models from text, but appears incremental as it builds on existing kernel methods.

The authors tackled the problem of customizing and scaling convolution kernels for natural language processing by proposing a nonstationary kernel model and a stochastic learning algorithm using k-nearest neighbor graphs and locality-sensitive hashing. They demonstrated improved performance on extracting biological models from scientific papers, though no concrete numbers were provided.

Natural language processing often involves computations with semantic or syntactic graphs to facilitate sophisticated reasoning based on structural relationships. While convolution kernels provide a powerful tool for comparing graph structure based on node (word) level relationships, they are difficult to customize and can be computationally expensive. We propose a generalization of convolution kernels, with a nonstationary model, for better expressibility of natural languages in supervised settings. For a scalable learning of the parameters introduced with our model, we propose a novel algorithm that leverages stochastic sampling on k-nearest neighbor graphs, along with approximations based on locality-sensitive hashing. We demonstrate the advantages of our approach on a challenging real-world (structured inference) problem of automatically extracting biological models from the text of scientific papers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes