CL AINov 27, 2022

X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications

Junyuan Shang, Shuohuan Wang, Yu Sun, Yanjun Yu, Yue Zhou, Li Xiang, Guixiu Yang

arXiv:2211.14734v131.7627 citationsh-index: 22

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific NLP task for instructional text understanding, with incremental improvements over existing methods.

The paper tackled the problem of identifying plausible clarifications in instructional texts by using a replaced token detection pre-trained model with pattern-aware ensembling, achieving 68.90% accuracy and 0.8070 Spearman's rank correlation, surpassing second place by 2.7 and 2.2 percentage points.

This paper describes our winning system on SemEval 2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts. A replaced token detection pre-trained model is utilized with minorly different task-specific heads for SubTask-A: Multi-class Classification and SubTask-B: Ranking. Incorporating a pattern-aware ensemble method, our system achieves a 68.90% accuracy score and 0.8070 spearman's rank correlation score surpassing the 2nd place with a large margin by 2.7 and 2.2 percent points for SubTask-A and SubTask-B, respectively. Our approach is simple and easy to implement, and we conducted ablation studies and qualitative and quantitative analyses for the working strategies used in our system.

View on arXiv PDF

Similar