CLSep 26, 2017

Learning to Explain Non-Standard English Words and Phrases

arXiv:1709.09254v11102 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of understanding slang and informal language for applications like natural language processing, though it is incremental as it builds on prior keyword-matching methods.

The paper tackles the problem of automatically explaining new, non-standard English expressions by learning a neural sequence-to-sequence model from a large dataset of crowdsourced examples, achieving reasonable definitions with certain confidence.

We describe a data-driven approach for automatically explaining new, non-standard English expressions in a given sentence, building on a large dataset that includes 15 years of crowdsourced examples from UrbanDictionary.com. Unlike prior studies that focus on matching keywords from a slang dictionary, we investigate the possibility of learning a neural sequence-to-sequence model that generates explanations of unseen non-standard English expressions given context. We propose a dual encoder approach---a word-level encoder learns the representation of context, and a second character-level encoder to learn the hidden representation of the target non-standard expression. Our model can produce reasonable definitions of new non-standard English expressions given their context with certain confidence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes