CVAIMay 4, 2024

Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images

arXiv:2405.02686v14 citationsh-index: 10
Originality Incremental advance
AI Analysis

This addresses data scarcity in neuroscience for neuron reconstruction, though it is incremental as it adapts existing methods to a specific domain.

The paper tackles the challenge of 3D neuron segmentation by proposing a training paradigm that uses a 2D Vision Transformer pre-trained on natural images to initialize a 3D model, improving segmentation performance by 8.71% on the BigNeuron benchmark.

Neuron reconstruction, one of the fundamental tasks in neuroscience, rebuilds neuronal morphology from 3D light microscope imaging data. It plays a critical role in analyzing the structure-function relationship of neurons in the nervous system. However, due to the scarcity of neuron datasets and high-quality SWC annotations, it is still challenging to develop robust segmentation methods for single neuron reconstruction. To address this limitation, we aim to distill the consensus knowledge from massive natural image data to aid the segmentation model in learning the complex neuron structures. Specifically, in this work, we propose a novel training paradigm that leverages a 2D Vision Transformer model pre-trained on large-scale natural images to initialize our Transformer-based 3D neuron segmentation model with a tailored 2D-to-3D weight transferring strategy. Our method builds a knowledge sharing connection between the abundant natural and the scarce neuron image domains to improve the 3D neuron segmentation ability in a data-efficiency manner. Evaluated on a popular benchmark, BigNeuron, our method enhances neuron segmentation performance by 8.71% over the model trained from scratch with the same amount of training samples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes