LGAICYMay 11

NaiAD: Initiate Data-Driven Research for LLM Advertising

arXiv:2605.0991827.4
Predicted impact top 12% in LG · last 90 daysOriginality Highly original
AI Analysis

This work provides foundational infrastructure for researchers and practitioners developing LLM-native advertising systems, addressing the lack of data-centric resources in this emerging domain.

NaiAD introduces the first comprehensive dataset for LLM-native advertising with 58,999 ad-embedded responses, enabling decoupled generation and variance-calibrated scoring. Models trained on NaiAD simultaneously improve user and commercial utility, with in-context learning allowing independent control over these objectives.

Reconciling platform revenue with user experience in LLM advertising motivates a data-centric foundation. We introduce NaiAD, the first comprehensive dataset for LLM-native advertising comprising 58,999 carefully constructed ad-embedded responses paired with user queries. NaiAD is organized around theoretically grounded evaluation metrics that separately and comprehensively capture user and commercial utility. To mitigate the dimensional collinearity of aligned LLMs, we propose a decoupled generation pipeline that produces structurally diverse samples, ranging from responses that explicitly disentangle stakeholder utilities to responses that are uniformly strong or weak across dimensions. We further provide score labels calibrated by a Variance-Calibrated Prediction-Powered Inference (VC-PPI) framework, aligning automated scoring with human annotations. Mechanistic analyses reveal that successful ad integration relies on reasoning paths that cluster into four distinct semantic strategies. Models leveraging NaiAD internalize these strategies to simultaneously improve user and commercial utility, while enabling independent control over these distinct objectives via in-context learning. Together, these results position NaiAD as a foundational infrastructure for developing future LLM-native ad systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes