CVAIAug 31, 2022

Temporal Flow Mask Attention for Open-Set Long-Tailed Recognition of Wild Animals in Camera-Trap Images

arXiv:2208.14625v13 citationsh-index: 40
Originality Incremental advance
AI Analysis

This addresses data distribution challenges in wildlife monitoring, but it is incremental as it builds on existing techniques for specific domain issues.

The paper tackles the open-set long-tailed recognition problem in camera-trap images of wild animals by proposing the Temporal Flow Mask Attention Network, which uses optical flow, attention residuals, and meta-embedding to achieve robust performance on a Korean DMZ dataset.

Camera traps, unmanned observation devices, and deep learning-based image recognition systems have greatly reduced human effort in collecting and analyzing wildlife images. However, data collected via above apparatus exhibits 1) long-tailed and 2) open-ended distribution problems. To tackle the open-set long-tailed recognition problem, we propose the Temporal Flow Mask Attention Network that comprises three key building blocks: 1) an optical flow module, 2) an attention residual module, and 3) a meta-embedding classifier. We extract temporal features of sequential frames using the optical flow module and learn informative representation using attention residual blocks. Moreover, we show that applying the meta-embedding technique boosts the performance of the method in open-set long-tailed recognition. We apply this method on a Korean Demilitarized Zone (DMZ) dataset. We conduct extensive experiments, and quantitative and qualitative analyses to prove that our method effectively tackles the open-set long-tailed recognition problem while being robust to unknown classes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes