X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications
This work addresses a domain-specific NLP task for instructional text understanding, with incremental improvements over existing methods.
The paper tackled the problem of identifying plausible clarifications in instructional texts by using a replaced token detection pre-trained model with pattern-aware ensembling, achieving 68.90% accuracy and 0.8070 Spearman's rank correlation, surpassing second place by 2.7 and 2.2 percentage points.
This paper describes our winning system on SemEval 2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts. A replaced token detection pre-trained model is utilized with minorly different task-specific heads for SubTask-A: Multi-class Classification and SubTask-B: Ranking. Incorporating a pattern-aware ensemble method, our system achieves a 68.90% accuracy score and 0.8070 spearman's rank correlation score surpassing the 2nd place with a large margin by 2.7 and 2.2 percent points for SubTask-A and SubTask-B, respectively. Our approach is simple and easy to implement, and we conducted ablation studies and qualitative and quantitative analyses for the working strategies used in our system.