CLJan 23, 2019

Context-Sensitive Malicious Spelling Error Correction

Hongyu Gong, Yuchen Li, Suma Bhat, Pramod Viswanath

arXiv:1901.07688v11.528 citations

Originality Incremental advance

AI Analysis

This addresses a specific cybersecurity issue for automated content control systems, but it is incremental as it builds on existing spell-checking methods with a context-aware approach.

The paper tackled the problem of malicious misspellings that degrade performance in profanity and spam detection by proposing a context-sensitive correction method using word embeddings, achieving superior performance compared to state-of-the-art spell checkers.

Misspelled words of the malicious kind work by changing specific keywords and are intended to thwart existing automated applications for cyber-environment control such as harassing content detection on the Internet and email spam detection. In this paper, we focus on malicious spelling correction, which requires an approach that relies on the context and the surface forms of targeted keywords. In the context of two applications--profanity detection and email spam detection--we show that malicious misspellings seriously degrade their performance. We then propose a context-sensitive approach for malicious spelling correction using word embeddings and demonstrate its superior performance compared to state-of-the-art spell checkers.

View on arXiv PDF

Similar