IMLGJun 25, 2024

A review of unsupervised learning in astronomy

arXiv:2406.17316v130 citations
Originality Synthesis-oriented
AI Analysis

It provides an overview for astronomers, but is incremental as it reviews existing methods without introducing new findings.

This review summarizes popular unsupervised learning methods and their applications in astronomy, focusing on organizing datasets to extract knowledge through techniques like dimensionality reduction and clustering, with recent trends towards self-supervised and semi-supervised approaches.

This review summarizes popular unsupervised learning methods, and gives an overview of their past, current, and future uses in astronomy. Unsupervised learning aims to organise the information content of a dataset, in such a way that knowledge can be extracted. Traditionally this has been achieved through dimensionality reduction techniques that aid the ranking of a dataset, for example through principal component analysis or by using auto-encoders, or simpler visualisation of a high dimensional space, for example through the use of a self organising map. Other desirable properties of unsupervised learning include the identification of clusters, i.e. groups of similar objects, which has traditionally been achieved by the k-means algorithm and more recently through density-based clustering such as HDBSCAN. More recently, complex frameworks have emerged, that chain together dimensionality reduction and clustering methods. However, no dataset is fully unknown. Thus, nowadays a lot of research has been directed towards self-supervised and semi-supervised methods that stand to gain from both supervised and unsupervised learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes