Data Synthesis Improves 3D Myotube Instance Segmentation
For biomedical researchers studying muscle physiology, this work provides a method to perform instance segmentation of myotubes without requiring large annotated datasets, addressing a key bottleneck in the field.
The paper introduces a geometry-driven synthetic data pipeline to generate realistic 3D myotube volumes, enabling a compact 3D U-Net to achieve a mean IPQ of 0.22 on real data, outperforming three established zero-shot segmentation models.
Myotubes are multinucleated muscle fibers serving as key model systems for studying muscle physiology, disease mechanisms, and drug responses. Mechanistic studies and drug screening thereby rely on quantitative morphological readouts such as diameter, length, and branching degree, which in turn require precise three-dimensional instance segmentation. Yet established pretrained biomedical segmentation models fail to generalize to this domain due to the absence of large annotated myotube datasets. We introduce a geometry-driven synthesis pipeline that models individual myotubes via polynomial centerlines, locally varying radii, branching structures, and ellipsoidal end caps derived from real microscopy observations. Synthetic volumes are rendered with realistic noise, optical artifacts, and CycleGAN-based Domain Adaptation (DA). A compact 3D U-Net with self-supervised encoder pretraining, trained exclusively on synthetic data, achieves a mean IPQ of 0.22 on real data, significantly outperforming three established zero-shot segmentation models, demonstrating that biophysics-driven synthesis enables effective instance segmentation in annotation-scarce biomedical domains.