CL LG MLAug 19, 2020

HeteGCN: Heterogeneous Graph Convolutional Networks for Text Classification

Rahul Ragesh, Sundararajan Sellamanickam, Arun Iyer, Ram Bairi, Vijay Lingam

arXiv:2008.12842v13.2100 citations

Originality Incremental advance

AI Analysis

This work addresses text classification challenges for researchers and practitioners by offering a more efficient and flexible graph-based method, though it appears incremental as it builds upon and simplifies existing approaches like TextGCN.

The paper tackled the problem of improving graph convolutional networks for text classification by addressing limitations in predictive performance, scalability, and inductive capability of existing methods like PTE and TextGCN, resulting in a proposed HeteGCN approach that reduces model parameters significantly for faster training and improved performance in small labeled training sets.

We consider the problem of learning efficient and inductive graph convolutional networks for text classification with a large number of examples and features. Existing state-of-the-art graph embedding based methods such as predictive text embedding (PTE) and TextGCN have shortcomings in terms of predictive performance, scalability and inductive capability. To address these limitations, we propose a heterogeneous graph convolutional network (HeteGCN) modeling approach that unites the best aspects of PTE and TextGCN together. The main idea is to learn feature embeddings and derive document embeddings using a HeteGCN architecture with different graphs used across layers. We simplify TextGCN by dissecting into several HeteGCN models which (a) helps to study the usefulness of individual models and (b) offers flexibility in fusing learned embeddings from different models. In effect, the number of model parameters is reduced significantly, enabling faster training and improving performance in small labeled training set scenario. Our detailed experimental studies demonstrate the efficacy of the proposed approach.

View on arXiv PDF

Similar