LG NA MLFeb 7, 2025

Discrepancies are Virtue: Weak-to-Strong Generalization through Lens of Intrinsic Dimension

Yijun Dong, Yicheng Li, Yunai Li, Jason D. Lee, Qi Lei

arXiv:2502.05075v517.99 citationsh-index: 7ICML

Originality Incremental advance

AI Analysis

This provides theoretical insights for machine learning practitioners using finetuning with pseudo-labels, though it is incremental as it builds on existing weak-to-strong generalization concepts.

The paper tackles the problem of understanding why weak-to-strong generalization in finetuning often outperforms the weak teacher by analyzing it through the lens of intrinsic dimensionality and variance reduction, showing that discrepancies between models reduce variance and improve performance with concrete factors like dim(V_s)/N.

Weak-to-strong (W2S) generalization is a type of finetuning (FT) where a strong (large) student model is trained on pseudo-labels generated by a weak teacher. Surprisingly, W2S FT often outperforms the weak teacher. We seek to understand this phenomenon through the observation that FT often occurs in intrinsically low-dimensional spaces. Leveraging the low intrinsic dimensionality of FT, we analyze W2S in the ridgeless regression setting from a variance reduction perspective. For a strong student-weak teacher pair with sufficiently expressive low-dimensional feature subspaces $\mathcal{V}_s, \mathcal{V}_w$, we provide an exact characterization of the variance that dominates the generalization error of W2S. This unveils a virtue of discrepancy between the strong and weak models in W2S: the variance of the weak teacher is inherited by the strong student in $\mathcal{V}_s \cap \mathcal{V}_w$, while reduced by a factor of $\mathrm{dim}(\mathcal{V}_s)/N$ in the subspace of discrepancy $\mathcal{V}_w \setminus \mathcal{V}_s$ with $N$ pseudo-labels for W2S. Our analysis further casts light on the sample complexities and the scaling of performance gap recovery in W2S. The analysis is supported by experiments on synthetic regression problems, as well as real vision and NLP tasks.

View on arXiv PDF

Similar