AI CLJun 2, 2019

Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation

Cunxiang Wang, Shuailong Liang, Yue Zhang, Xiaonan Li, Tian Gao

arXiv:1906.00363v260.31154 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the need for better evaluation of commonsense reasoning in AI systems, though it is incremental as it builds on existing benchmarks.

The authors tackled the problem of evaluating sense-making capabilities in natural language understanding systems by releasing a benchmark that directly tests if systems can differentiate sensible from non-sensible statements and identify reasons for non-sense, showing different challenges compared to human performance.

Introducing common sense to natural language understanding systems has received increasing research attention. It remains a fundamental question on how to evaluate whether a system has a sense making capability. Existing benchmarks measures commonsense knowledge indirectly and without explanation. In this paper, we release a benchmark to directly test whether a system can differentiate natural language statements that make sense from those that do not make sense. In addition, a system is asked to identify the most crucial reason why a statement does not make sense. We evaluate models trained over large-scale language modeling tasks as well as human performance, showing that there are different challenges for system sense making.

View on arXiv PDF Code

Similar