CLAIMay 2, 2020

Contrastive Self-Supervised Learning for Commonsense Reasoning

arXiv:2005.00669v11032 citations
Originality Incremental advance
AI Analysis

This addresses commonsense reasoning in NLP, offering a more efficient self-supervised alternative to supervised methods.

The paper tackles pronoun disambiguation and Winograd Schema Challenge problems by proposing a self-supervised method using contrastive learning, achieving performance gains that alleviate limitations of current supervised approaches.

We propose a self-supervised method to solve Pronoun Disambiguation and Winograd Schema Challenge problems. Our approach exploits the characteristic structure of training corpora related to so-called "trigger" words, which are responsible for flipping the answer in pronoun disambiguation. We achieve such commonsense reasoning by constructing pair-wise contrastive auxiliary predictions. To this end, we leverage a mutual exclusive loss regularized by a contrastive margin. Our architecture is based on the recently introduced transformer networks, BERT, that exhibits strong performance on many NLP benchmarks. Empirical results show that our method alleviates the limitation of current supervised approaches for commonsense reasoning. This study opens up avenues for exploiting inexpensive self-supervision to achieve performance gain in commonsense reasoning tasks.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes