LG MLOct 20, 2020

RDIS: Random Drop Imputation with Self-Training for Incomplete Time Series Data

arXiv:2010.10075v28.527 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of imputing missing values in time-series data for fields like healthcare and meteorology, but it is incremental as it builds on existing imputation models.

The paper tackles the problem of training imputation models for incomplete time series data by proposing RDIS, a method that generates extra missing values through random drops and uses self-training with pseudo values, achieving competitive results on two real-world datasets.

Time-series data with missing values are commonly encountered in many fields, such as healthcare, meteorology, and robotics. The imputation aims to fill the missing values with valid values. Most imputation methods trained the models implicitly because missing values have no ground truth. In this paper, we propose Random Drop Imputation with Self-training (RDIS), a novel training method for time-series data imputation models. In RDIS, we generate extra missing values by applying a random drop on the observed values in incomplete data. We can explicitly train the imputation models by filling in the randomly dropped values. In addition, we adopt self-training with pseudo values to exploit the original missing values. To improve the quality of pseudo values, we set the threshold and filter them by calculating the entropy. To verify the effectiveness of RDIS on the time series imputation, we test RDIS to various imputation models and achieve competitive results on two real-world datasets.

View on arXiv PDF

Similar