CLAIMay 26, 2017

A WL-SPPIM Semantic Model for Document Classification

arXiv:1706.01758v1
Originality Incremental advance
AI Analysis

This work addresses text classification for researchers, offering an incremental improvement over prior methods.

The paper tackles text classification by proposing a WL-SPPIM semantic model, which shows better classification performance and higher scalability compared to existing methods like LDA, SGNS, and SPPIM on standard datasets such as 20newsgroups, Reuters52, and WebKB.

In this paper, we explore SPPIM-based text classification method, and the experiment reveals that the SPPIM method is equal to or even superior than SGNS method in text classification task on three international and standard text datasets, namely 20newsgroups, Reuters52 and WebKB. Comparing to SGNS, although SPPMI provides a better solution, it is not necessarily better than SGNS in text classification tasks. Based on our analysis, SGNS takes into the consideration of weight calculation during decomposition process, so it has better performance than SPPIM in some standard datasets. Inspired by this, we propose a WL-SPPIM semantic model based on SPPIM model, and experiment shows that WL-SPPIM approach has better classification and higher scalability in the text classification task compared with LDA, SGNS and SPPIM approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes