Topic Model Supervised by Understanding Map
This is an incremental improvement for natural language processing researchers, as it introduces a novel method for topic modeling by incorporating semantic networks.
The authors tackled the problem of topic modeling by proposing a new framework, UM-S-TM, which uses a Semantic Center of Mass (SCOM) inspired by physics to interpret document meaning with both content and a semantic network, and compared it to models like LDA and pLSA.
Inspired by the notion of Center of Mass in physics, an extension called Semantic Center of Mass (SCOM) is proposed, and used to discover the abstract "topic" of a document. The notion is under a framework model called Understanding Map Supervised Topic Model (UM-S-TM). The devising aim of UM-S-TM is to let both the document content and a semantic network -- specifically, Understanding Map -- play a role, in interpreting the meaning of a document. Based on different justifications, three possible methods are devised to discover the SCOM of a document. Some experiments on artificial documents and Understanding Maps are conducted to test their outcomes. In addition, its ability of vectorization of documents and capturing sequential information are tested. We also compared UM-S-TM with probabilistic topic models like Latent Dirichlet Allocation (LDA) and probabilistic Latent Semantic Analysis (pLSA).