SynGen-Vision: Synthetic Data Generation for training industrial vision models
This addresses the data scarcity issue for predictive maintenance tasks in industries, though it is incremental as it applies existing synthetic data generation methods to a specific domain.
The paper tackles the problem of expensive and time-consuming data curation for training computer vision models in industrial wear and tear detection by generating synthetic data using a vision language model and 3D simulation, resulting in a model that achieves a mAP50 score of 0.87 for rust detection.
We propose an approach to generate synthetic data to train computer vision (CV) models for industrial wear and tear detection. Wear and tear detection is an important CV problem for predictive maintenance tasks in any industry. However, data curation for training such models is expensive and time-consuming due to the unavailability of datasets for different wear and tear scenarios. Our approach employs a vision language model along with a 3D simulation and rendering engine to generate synthetic data for varying rust conditions. We evaluate our approach by training a CV model for rust detection using the generated dataset and tested the trained model on real images of rusted industrial objects. The model trained with the synthetic data generated by our approach, outperforms the other approaches with a mAP50 score of 0.87. The approach is customizable and can be easily extended to other industrial wear and tear detection scenarios