CVSep 12, 2024

MambaMIC: An Efficient Baseline for Microscopic Image Classification with State Space Models

arXiv:2409.07896v21 citationsh-index: 7Has Code
AI Analysis

This work addresses a domain-specific problem for researchers and practitioners in microscopic image analysis, offering an incremental improvement over existing CNN and Transformer-based methods.

The paper tackles the challenge of balancing global modeling and efficient computation in Microscopic Image Classification (MIC) by proposing MambaMIC, a vision backbone that achieves state-of-the-art performance across five datasets.

In recent years, CNN and Transformer-based methods have made significant progress in Microscopic Image Classification (MIC). However, existing approaches still face the dilemma between global modeling and efficient computation. While the Selective State Space Model (SSM) can simulate long-range dependencies with linear complexity, it still encounters challenges in MIC, such as local pixel forgetting, channel redundancy, and lack of local perception. To address these issues, we propose a simple yet efficient vision backbone for MIC tasks, named MambaMIC. Specifically, we introduce a Local-Global dual-branch aggregation module: the MambaMIC Block, designed to effectively capture and fuse local connectivity and global dependencies. In the local branch, we use local convolutions to capture pixel similarity, mitigating local pixel forgetting and enhancing perception. In the global branch, SSM extracts global dependencies, while Locally Aware Enhanced Filter reduces channel redundancy and local pixel forgetting. Additionally, we design a Feature Modulation Interaction Aggregation Module for deep feature interaction and key feature re-localization. Extensive benchmarking shows that MambaMIC achieves state-of-the-art performance across five datasets. code is available at https://zs1314.github.io/MambaMIC

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes