CVNov 18, 2024

KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling

arXiv:2411.11926v29 citationsh-index: 2
Originality Highly original
AI Analysis

This work addresses the need for accurate segmentation in medical images for applications like robotic surgeries and disease diagnosis, representing an incremental improvement by hybridizing existing methods.

The paper tackled the problem of medical image segmentation by proposing KAN-Mamba FusionNet, which combines Kolmogorov-Arnold Networks and Mamba to capture non-linearities and long-range dependencies, resulting in consistent outperformance of state-of-the-art methods in IoU and F1 scores on three datasets.

Medical image segmentation is essential for applications like robotic surgeries, disease diagnosis, and treatment planning. Recently, various deep-learning models have been proposed to enhance medical image segmentation. One promising approach utilizes Kolmogorov-Arnold Networks (KANs), which better capture non-linearity in input data. However, they are unable to effectively capture long-range dependencies, which are required to accurately segment complex medical images and, by that, improve diagnostic accuracy in clinical settings. Neural networks such as Mamba can handle long-range dependencies. However, they have a limited ability to accurately capture non-linearities in the images as compared to KANs. Thus, we propose a novel architecture, the KAN-Mamba FusionNet, which improves segmentation accuracy by effectively capturing the non-linearities from input and handling long-range dependencies with the newly proposed KAMBA block. We evaluated the proposed KAN-Mamba FusionNet on three distinct medical image segmentation datasets: BUSI, Kvasir-Seg, and GlaS - and found it consistently outperforms state-of-the-art methods in IoU and F1 scores. Further, we examined the effects of various components and assessed their contributions to the overall model performance via ablation studies. The findings highlight the effectiveness of this methodology for reliable medical image segmentation, providing a unique approach to address intricate visual data issues in healthcare.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes