LG CV IVApr 18, 2025

MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework

Zhenkai Qin, Feng Zhu, Huan Zeng, Xunyi Nong

arXiv:2504.13574v1h-index: 5

Originality Incremental advance

AI Analysis

This provides a deployable solution for image classification in edge or real-time systems, though it appears incremental as it builds on existing attention mechanisms with optimizations.

The paper tackles the problem of high computational complexity in attention mechanisms for image classification on resource-constrained devices by proposing MAAM, a lightweight multi-agent aggregation module integrated with MindSpore, achieving 87.0% accuracy on CIFAR-10 with 30% training efficiency improvement.

The demand for lightweight models in image classification tasks under resource-constrained environments necessitates a balance between computational efficiency and robust feature representation. Traditional attention mechanisms, despite their strong feature modeling capability, often struggle with high computational complexity and structural rigidity, limiting their applicability in scenarios with limited computational resources (e.g., edge devices or real-time systems). To address this, we propose the Multi-Agent Aggregation Module (MAAM), a lightweight attention architecture integrated with the MindSpore framework. MAAM employs three parallel agent branches with independently parameterized operations to extract heterogeneous features, adaptively fused via learnable scalar weights, and refined through a convolutional compression layer. Leveraging MindSpore's dynamic computational graph and operator fusion, MAAM achieves 87.0% accuracy on the CIFAR-10 dataset, significantly outperforming conventional CNN (58.3%) and MLP (49.6%) models, while improving training efficiency by 30%. Ablation studies confirm the critical role of agent attention (accuracy drops to 32.0% if removed) and compression modules (25.5% if omitted), validating their necessity for maintaining discriminative feature learning. The framework's hardware acceleration capabilities and minimal memory footprint further demonstrate its practicality, offering a deployable solution for image classification in resource-constrained scenarios without compromising accuracy.

View on arXiv PDF

Similar