LGCVOct 1, 2020

Tabular GANs for uneven distribution

arXiv:2010.00638v13 citations
Originality Synthesis-oriented
AI Analysis

This addresses data imbalance issues in tabular machine learning, but it is incremental as it applies existing GAN methods to a specific data type without introducing new techniques.

The paper tackles the problem of uneven data distribution between train and test sets in tabular data by using GANs to generate data that aligns the distributions, showing that this approach can improve model performance as an alternative option.

GANs are well known for success in the realistic image generation. However, they can be applied in tabular data generation as well. We will review and examine some recent papers about tabular GANs in action. We will generate data to make train distribution bring closer to the test. Then compare model performance trained on the initial train dataset, with trained on the train with GAN generated data, also we train the model by sampling train by adversarial training. We show that using GAN might be an option in case of uneven data distribution between train and test data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes