CVLGIVFeb 1, 2025

A framework for river connectivity classification using temporal image processing and attention based neural networks

arXiv:2502.00474v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem for water resource managers and stream ecologists by providing an automated, low-cost alternative to manual image curation, though it is incremental as it builds on existing vision transformer and image processing techniques.

The paper tackles the problem of automating river connectivity classification from trail camera images, which is costly and time-consuming when done manually, and achieves a 90% accuracy on unseen site images, up from a 75% base accuracy.

Measuring the connectivity of water in rivers and streams is essential for effective water resource management. Increased extreme weather events associated with climate change can result in alterations to river and stream connectivity. While traditional stream flow gauges are costly to deploy and limited to large river bodies, trail camera methods are a low-cost and easily deployed alternative to collect hourly data. Image capturing, however requires stream ecologists to manually curate (select and label) tens of thousands of images per year. To improve this workflow, we developed an automated instream trail camera image classification system consisting of three parts: (1) image processing, (2) image augmentation and (3) machine learning. The image preprocessing consists of seven image quality filters, foliage-based luma variance reduction, resizing and bottom-center cropping. Images are balanced using variable amount of generative augmentation using diffusion models and then passed to a machine learning classification model in labeled form. By using the vision transformer architecture and temporal image enhancement in our framework, we are able to increase the 75% base accuracy to 90% for a new unseen site image. We make use of a dataset captured and labeled by staff from the Connecticut Department of Energy and Environmental Protection between 2018-2020. Our results indicate that a combination of temporal image processing and attention-based models are effective at classifying unseen river connectivity images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes