IMGACVLGSep 28, 2014

Combining human and machine learning for morphological analysis of galaxy images

arXiv:1409.7935v145 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for scalable morphological analysis in astronomy, but it is incremental as it builds on existing citizen science methods.

The paper tackles the problem of analyzing large galaxy image databases by using citizen science data to train machine learning systems, showing that performance depends on data quality, which can be improved by using samples with high agreement among citizen scientists.

The increasing importance of digital sky surveys collecting many millions of galaxy images has reinforced the need for robust methods that can perform morphological analysis of large galaxy image databases. Citizen science initiatives such as Galaxy Zoo showed that large datasets of galaxy images can be analyzed effectively by non-scientist volunteers, but since databases generated by robotic telescopes grow much faster than the processing power of any group of citizen scientists, it is clear that computer analysis is required. Here we propose to use citizen science data for training machine learning systems, and show experimental results demonstrating that machine learning systems can be trained with citizen science data. Our findings show that the performance of machine learning depends on the quality of the data, which can be improved by using samples that have a high degree of agreement between the citizen scientists. The source code of the method is publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes