CL LGJun 2, 2021

Unsupervised Out-of-Domain Detection via Pre-trained Transformers

Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng, Caiming Xiong

arXiv:2106.00948v231.8719 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses safety issues in real-world ML applications by enabling out-of-domain detection without task labels, though it is incremental as it builds on pre-trained transformers.

The paper tackles the problem of detecting out-of-domain samples using only unsupervised in-domain data, achieving improved detection ability in general scenarios as validated on two datasets.

Deployed real-world machine learning applications are often subject to uncontrolled and even potentially malicious inputs. Those out-of-domain inputs can lead to unpredictable outputs and sometimes catastrophic safety issues. Prior studies on out-of-domain detection require in-domain task labels and are limited to supervised classification scenarios. Our work tackles the problem of detecting out-of-domain samples with only unsupervised in-domain data. We utilize the latent representations of pre-trained transformers and propose a simple yet effective method to transform features across all layers to construct out-of-domain detectors efficiently. Two domain-specific fine-tuning approaches are further proposed to boost detection accuracy. Our empirical evaluations of related methods on two datasets validate that our method greatly improves out-of-domain detection ability in a more general scenario.

View on arXiv PDF Code

Similar