CRMay 1

Repurposing Image Diffusion Models for Adversarial Synthetic Structured Data: A Case Study of Ground Truth Drift

arXiv:2605.0078810.1
Predicted impact top 81% in CR · last 90 daysOriginality Incremental advance
AI Analysis

This work identifies a new security vulnerability for ML pipelines that rely on synthetic data, showing that attackers can repurpose off-the-shelf image diffusion models to produce adversarial tabular data.

The authors test whether an unmodified Stable Diffusion U-Net can generate adversarial synthetic structured data from the UCI Adult Income dataset by reshaping rows into pseudo-images, achieving successful attacks that induce ground truth drift—the silent reclassification of AI-generated outputs as authentic in provenance-unaware pipelines.

Public image diffusion models are now powerful enough that an attacker without the resources to train a tabular-specific generator may repurpose one off the shelf. This study tests that possibility directly. An unmodified Stable Diffusion U-Net is applied to the UCI Adult Income dataset by reshaping each row into a small single-channel pseudo-image. The architecture's inductive bias toward spatial locality makes feature placement a design variable, and several layouts are tested. However, this is only the beginning of the story, as this paper also draws two philosophical distinctions. One separates statistical from perceptual realism: whether synthetic content holds up to a machine's correlation audits or a human's sensory inspection. The other introduces synthetic evidence as a category alongside synthetic media: AI-generated material whose consumer is a machine in a closed evidentiary pipeline rather than a person in an open information system. An attacker succeeds with synthetic evidence by thinking like the machine that will receive it. And the more the attacker succeeds, the more they can induce ground truth drift: the silent reclassification of AI-generated outputs as authentic when reused in pipelines that do not interrogate their provenance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes