LGCRNov 17, 2025

Beyond One-Size-Fits-All: Neural Networks for Differentially Private Tabular Data Synthesis

arXiv:2511.13893v1h-index: 5Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of generating high-quality synthetic data with privacy guarantees for data analysts, offering significant improvements in speed and accuracy, though it is incremental by integrating existing designs into neural networks.

The paper tackles the challenge of synthesizing differentially private tabular data, particularly for densely correlated datasets where neural networks outperform statistical models, and proposes MargNet, which achieves a 7x speedup on sparse datasets and reduces fidelity error by up to 26% on dense datasets compared to previous methods.

In differentially private (DP) tabular data synthesis, the consensus is that statistical models are better than neural network (NN)-based methods. However, we argue that this conclusion is incomplete and overlooks the challenge of densely correlated datasets, where intricate dependencies can overwhelm statistical models. In such complex scenarios, neural networks are more suitable due to their capacity to fit complex distributions by learning directly from samples. Despite this potential, existing NN-based algorithms still suffer from significant limitations. We therefore propose MargNet, incorporating successful algorithmic designs of statistical models into neural networks. MargNet applies an adaptive marginal selection strategy and trains the neural networks to generate data that conforms to the selected marginals. On sparsely correlated datasets, our approach achieves utility close to the best statistical method while offering an average 7$\times$ speedup over it. More importantly, on densely correlated datasets, MargNet establishes a new state-of-the-art, reducing fidelity error by up to 26\% compared to the previous best. We release our code on GitHub.\footnote{https://github.com/KaiChen9909/margnet}

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes