Character-level Convolutional Network for Text Classification Applied to Chinese Corpus
This addresses text classification for Chinese language users, but it is incremental as it adapts an existing method to a new dataset.
The authors tackled Chinese text classification by applying a character-level convolutional neural network to a new large-scale Chinese dataset, finding it outperformed the same model on pinyin format data.
This article provides an interesting exploration of character-level convolutional neural network solving Chinese corpus text classification problem. We constructed a large-scale Chinese language dataset, and the result shows that character-level convolutional neural network works better on Chinese corpus than its corresponding pinyin format dataset. This is the first time that character-level convolutional neural network applied to text classification problem.