LG MLJun 6, 2024

Meta-learning for Positive-unlabeled Classification

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

arXiv:2406.03680v14.61 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of insufficient data in PU learning for applications like outlier detection and information retrieval, though it is incremental as it builds on existing PU learning methods.

The authors tackled the problem of positive-unlabeled (PU) classification by proposing a meta-learning method that improves binary classifier performance on unseen target tasks using only PU data, and they empirically demonstrated that it outperforms existing methods on one synthetic and three real-world datasets.

We propose a meta-learning method for positive and unlabeled (PU) classification, which improves the performance of binary classifiers obtained from only PU data in unseen target tasks. PU learning is an important problem since PU data naturally arise in real-world applications such as outlier detection and information retrieval. Existing PU learning methods require many PU data, but sufficient data are often unavailable in practice. The proposed method minimizes the test classification risk after the model is adapted to PU data by using related tasks that consist of positive, negative, and unlabeled data. We formulate the adaptation as an estimation problem of the Bayes optimal classifier, which is an optimal classifier to minimize the classification risk. The proposed method embeds each instance into a task-specific space using neural networks. With the embedded PU data, the Bayes optimal classifier is estimated through density-ratio estimation of PU densities, whose solution is obtained as a closed-form solution. The closed-form solution enables us to efficiently and effectively minimize the test classification risk. We empirically show that the proposed method outperforms existing methods with one synthetic and three real-world datasets.

View on arXiv PDF

Similar