LGAIApr 28

Knowledge Distillation Must Account for What It Loses

arXiv:2604.2511018.4
Predicted impact top 20% in LG · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners deploying distilled models from large frontier systems, the paper highlights that current evaluation hides critical losses in reliability and safety, urging a more comprehensive assessment framework.

This position paper argues that knowledge distillation must evaluate student models not only by retained task scores but also by whether they preserve teacher capabilities such as uncertainty, boundary behavior, safety, and diversity. It proposes a taxonomy of off-metric losses and a Distillation Loss Statement for accountable reporting.

This position paper argues that knowledge distillation must account for what it loses: student models should be judged not only by retained task scores, but by whether they preserve the teacher capabilities that make those scores reliable. This matters because distillation is increasingly used to turn large, often frontier models into deployable systems, yet headline metrics can hide losses in uncertainty, boundary behavior, process reliability, on-policy stability, grounding, privacy, safety, and diversity. We identify the retention assumption behind current evaluation and reframe distillation as a lossy projection of teacher behavior rather than a faithful copy. We then synthesize existing evidence into a taxonomy of off-metric distillation losses, showing that these losses are concrete, recurring, and measurable. To make the position actionable, we propose scenario-specific preservation targets and a Distillation Loss Statement that reports what was preserved, what was lost, and why the remaining losses are acceptable. The goal is not lossless distillation, but accountable distillation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes