CVMay 12

Few-Shot Synthetic Data Generation with Diffusion Models for Downstream Vision Tasks

arXiv:2605.1189845.5
Predicted impact top 74% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners in safety-critical domains with class imbalance, this provides a simple and scalable method to augment rare classes using few real examples.

The paper proposes a lightweight synthetic data augmentation pipeline using LoRA-adapted diffusion models to generate synthetic samples from as few as 20-50 real images of rare classes. Across chest X-ray pathology classification and industrial surface crack detection, synthetic augmentation consistently improves rare-class recall and F1 compared to training with real data alone.

Class imbalance is a persistent challenge in visual recognition, particularly in safety-critical domains where collecting positive examples is expensive and rare events are inherently underrepresented. We propose a lightweight synthetic data augmentation pipeline that fine-tunes a LoRA adapter on as few as 20-50 real images of a rare class and uses a pretrained diffusion model to generate synthetic samples for training. We systematically vary the synthetic-to-real ratio and evaluate the approach across two structurally different domains: chest X-ray pathology classification (NIH ChestX-ray14) and industrial surface crack detection (Magnetic Tile Defect dataset). All evaluations are performed on held-out sets of real images only. Across both domains, synthetic augmentation consistently improves rare-class recall and F1 compared to training with real data alone. Performance improves with moderate synthetic augmentation and shows diminishing returns as the synthetic ratio increases. These results suggest that LoRA-adapted diffusion models provide a simple and scalable mechanism for augmenting rare classes, enabling effective learning in data-scarce scenarios across heterogeneous visual domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes