CLFeb 3, 2021

A Computational Framework for Slang Generation

arXiv:2102.01826v2650 citations
AI Analysis

This work provides a framework for the automated generation and interpretation of informal language, which is a problem for natural language processing systems dealing with flexible and data-scarce language types.

This paper addresses the challenge of generating slang, a flexible and data-scarce informal language type. The authors developed a computational framework that models speaker word choice in slang contexts, outperforming state-of-the-art language models and accurately predicting historical slang emergence from the 1960s to 2000s.

Slang is a common type of informal language, but its flexible nature and paucity of data resources present challenges for existing natural language systems. We take an initial step toward machine generation of slang by developing a framework that models the speaker's word choice in slang context. Our framework encodes novel slang meaning by relating the conventional and slang senses of a word while incorporating syntactic and contextual knowledge in slang usage. We construct the framework using a combination of probabilistic inference and neural contrastive learning. We perform rigorous evaluations on three slang dictionaries and show that our approach not only outperforms state-of-the-art language models, but also better predicts the historical emergence of slang word usages from 1960s to 2000s. We interpret the proposed models and find that the contrastively learned semantic space is sensitive to the similarities between slang and conventional senses of words. Our work creates opportunities for the automated generation and interpretation of informal language.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes