LGApr 19

Towards a Data-Parameter Correspondence for LLMs: A Preliminary Discussion

arXiv:2604.1738455.91 citationsh-index: 1

AI Analysis

For LLM researchers, this provides a theoretical framework to unify and transfer methods across data and parameter optimization communities, but the work is preliminary with no empirical validation.

This paper establishes a unified data-parameter correspondence for LLMs, revealing that data-centric and model-centric optimizations are dual manifestations of the same geometric structure. It identifies three correspondences (geometric, low-rank, security-privacy) and suggests that cooperative optimization may outperform isolated approaches.

Large language model optimization has historically bifurcated into isolated data-centric and model-centric paradigms: the former manipulates involved samples through selection, augmentation, or poisoning, while the latter tunes model weights via masking, quantization, or low-rank adaptation. This paper establishes a unified \emph{data-parameter correspondence} revealing these seemingly disparate operations as dual manifestations of the same geometric structure on the statistical manifold $\mathcal{M}$. Grounded in the Fisher-Rao metric $g_{ij}(θ)$ and Legendre duality between natural ($θ$) and expectation ($η$) parameters, we identify three fundamental correspondences spanning the model lifecycle: 1. Geometric correspondence: data pruning and parameter sparsification equivalently reduce manifold volume via dual coordinate constraints; 2. Low-rank correspondence: in-context learning (ICL) and LoRA adaptation explore identical subspaces on the Grassmannian $\mathcal{G}(r,d)$, with $k$-shot samples geometrically equivalent to rank-$r$ updates; 3. Security-privacy correspondence: adversarial attacks exhibit cooperative amplification between data poisoning and parameter backdoors, whereas protective mechanisms follow cascading attenuation where data compression multiplicatively enhances parameter privacy. Extending from training through post-training compression to inference, this framework provides mathematical formalization for cross-community methodology transfer, demonstrating that cooperative optimization integrating data and parameter modalities may outperform isolated approaches across efficiency, robustness, and privacy dimensions.

View on arXiv PDF

Similar