Molecular Graph Contrastive Learning with Line Graph
This addresses label scarcity in molecular property prediction and drug design, offering an incremental improvement over existing contrastive learning approaches.
The paper tackles label scarcity in molecular property prediction by proposing LEMON, a graph contrastive learning method that contrasts molecular graphs with their line graphs to preserve semantics, achieving superior performance compared to state-of-the-art methods.
Trapped by the label scarcity in molecular property prediction and drug design, graph contrastive learning (GCL) came forward. Leading contrastive learning works show two kinds of view generators, that is, random or learnable data corruption and domain knowledge incorporation. While effective, the two ways also lead to molecular semantics altering and limited generalization capability, respectively. To this end, we relate the \textbf{L}in\textbf{E} graph with \textbf{MO}lecular graph co\textbf{N}trastive learning and propose a novel method termed \textit{LEMON}. Specifically, by contrasting the given graph with the corresponding line graph, the graph encoder can freely encode the molecular semantics without omission. Furthermore, we present a new patch with edge attribute fusion and two local contrastive losses enhance information transmission and tackle hard negative samples. Compared with state-of-the-art (SOTA) methods for view generation, superior performance on molecular property prediction suggests the effectiveness of our proposed framework.