IMGASRLGDec 4, 2019

Streaming Classification of Variable Stars

arXiv:1912.02235v112 citations
Originality Incremental advance
AI Analysis

This work addresses the need for real-time, incremental classification systems in astronomy for handling daily data streams from surveys like LSST, representing an incremental improvement over existing offline methods.

The authors tackled the problem of classifying variable stars in streaming data from upcoming surveys like LSST by proposing a streaming probabilistic classification model with incremental features, achieving high classification performance and being an order of magnitude faster than traditional methods.

In the last years, automatic classification of variable stars has received substantial attention. Using machine learning techniques for this task has proven to be quite useful. Typically, machine learning classifiers used for this task require to have a fixed training set, and the training process is performed offline. Upcoming surveys such as the Large Synoptic Survey Telescope (LSST) will generate new observations daily, where an automatic classification system able to create alerts online will be mandatory. A system with those characteristics must be able to update itself incrementally. Unfortunately, after training, most machine learning classifiers do not support the inclusion of new observations in light curves, they need to re-train from scratch. Naively re-training from scratch is not an option in streaming settings, mainly because of the expensive pre-processing routines required to obtain a vector representation of light curves (features) each time we include new observations. In this work, we propose a streaming probabilistic classification model; it uses a set of newly designed features that work incrementally. With this model, we can have a machine learning classifier that updates itself in real time with new observations. To test our approach, we simulate a streaming scenario with light curves from CoRot, OGLE and MACHO catalogs. Results show that our model achieves high classification performance, staying an order of magnitude faster than traditional classification approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes