MMJan 27, 2021

An Ultra-Specific Image Dataset for Automated Insect Identification

D. L. Abeywardhana, C. D. Dangalle, Anupiya Nugaliyadde, Yashas Mallawarachchi

arXiv:2101.11463v12.31 citations

Originality Synthesis-oriented

AI Analysis

This provides a new benchmark for classification algorithms in a domain-specific task of insect identification, but it is incremental as it focuses on a narrow dataset without broad methodological advances.

The paper tackles the problem of automated insect identification by introducing an ultra-specific image dataset of tiger beetles, which is limited, imbalanced, and diverse, and evaluates it with classification algorithms, finding that transfer learning models performed well.

Automated identification of insects is a tough task where many challenges like data limitation, imbalanced data count, and background noise needs to be overcome for better performance. This paper describes such an image dataset which consists of a limited, imbalanced number of images regarding six genera of subfamily Cicindelinae (tiger beetles) of order Coleoptera. The diversity of image collection is at a high level as the images were taken from different sources, angles and on different scales. Thus, the salient regions of the images have a large variation. Therefore, one of the main intentions in this process was to get an idea about the image dataset while comparing different unique patterns and features in images. The dataset was evaluated on different classification algorithms including deep learning models based on different approaches to provide a benchmark. The dynamic nature of the dataset poses a challenge to the image classification algorithms. However transfer learning models using softmax classifier performed well on current dataset. The tiger beetle classification can be challenging even to a trained human eye, therefore, this dataset opens a new avenue for the classification algorithms to develop, to identify features which human eyes have not identified.

View on arXiv PDF

Similar