CVMay 29, 2025

MangoLeafViT: Leveraging Lightweight Vision Transformer with Runtime Augmentation for Efficient Mango Leaf Disease Classification

arXiv:2505.23961v17 citationsh-index: 122024 27th International Conference on Computer and Information Technology (ICCIT)
Originality Incremental advance
AI Analysis

This work addresses the problem of computational efficiency for mango disease classification in agriculture, offering a solution compatible with low-end devices, though it is incremental as it builds on existing Vision Transformer methods.

The paper tackled efficient mango leaf disease classification by proposing a lightweight Vision Transformer pipeline with runtime augmentation, achieving 99.43% accuracy on the MangoLeafBD dataset while reducing model size, parameters, and FLOPs.

Ensuring food safety is critical due to its profound impact on public health, economic stability, and global supply chains. Cultivation of Mango, a major agricultural product in several South Asian countries, faces high financial losses due to different diseases, affecting various aspects of the entire supply chain. While deep learning-based methods have been explored for mango leaf disease classification, there remains a gap in designing solutions that are computationally efficient and compatible with low-end devices. In this work, we propose a lightweight Vision Transformer-based pipeline with a self-attention mechanism to classify mango leaf diseases, achieving state-of-the-art performance with minimal computational overhead. Our approach leverages global attention to capture intricate patterns among disease types and incorporates runtime augmentation for enhanced performance. Evaluation on the MangoLeafBD dataset demonstrates a 99.43% accuracy, outperforming existing methods in terms of model size, parameter count, and FLOPs count.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes