CLJan 23, 2019

Context-Sensitive Malicious Spelling Error Correction

arXiv:1901.07688v128 citations
Originality Incremental advance
AI Analysis

This addresses a specific cybersecurity issue for automated content control systems, but it is incremental as it builds on existing spell-checking methods with a context-aware approach.

The paper tackled the problem of malicious misspellings that degrade performance in profanity and spam detection by proposing a context-sensitive correction method using word embeddings, achieving superior performance compared to state-of-the-art spell checkers.

Misspelled words of the malicious kind work by changing specific keywords and are intended to thwart existing automated applications for cyber-environment control such as harassing content detection on the Internet and email spam detection. In this paper, we focus on malicious spelling correction, which requires an approach that relies on the context and the surface forms of targeted keywords. In the context of two applications--profanity detection and email spam detection--we show that malicious misspellings seriously degrade their performance. We then propose a context-sensitive approach for malicious spelling correction using word embeddings and demonstrate its superior performance compared to state-of-the-art spell checkers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes