CLOct 25, 2022

A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Nankai Lin, Hongyan Wu, Sihui Fu, Shengyi Jiang, Aimin Yang

arXiv:2210.13823v20.32 citationsh-index: 11

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in Chinese spelling check for natural language processing applications, offering an incremental improvement over existing methods.

The paper tackles the problem of distinguishing confusable words in Chinese spelling check by proposing a reverse contrastive learning strategy that minimizes agreement between phonetically and visually similar characters, resulting in a model-agnostic framework that achieves state-of-the-art performance.

Chinese spelling check is a task to detect and correct spelling mistakes in Chinese text. Existing research aims to enhance the text representation and use multi-source information to improve the detection and correction capabilities of models, but does not pay too much attention to improving their ability to distinguish between confusable words. Contrastive learning, whose aim is to minimize the distance in representation space between similar sample pairs, has recently become a dominant technique in natural language processing. Inspired by contrastive learning, we present a novel framework for Chinese spelling checking, which consists of three modules: language representation, spelling check and reverse contrastive learning. Specifically, we propose a reverse contrastive learning strategy, which explicitly forces the model to minimize the agreement between the similar examples, namely, the phonetically and visually confusable characters. Experimental results show that our framework is model-agnostic and could be combined with existing Chinese spelling check models to yield state-of-the-art performance.

View on arXiv PDF

Similar