LG AIJun 2

Bayes-Sufficient Representations in Supervised Learning

arXiv:2606.0404521.9

Predicted impact top 81% in LG · last 90 daysOriginality Incremental advance

AI Analysis

For machine learning practitioners, this provides a theoretical framework to identify and learn loss-dependent minimal sufficient representations, potentially improving efficiency and interpretability in supervised learning.

This work defines Bayes-sufficient representations for supervised learning, showing that the minimal information required for Bayes-optimal prediction is determined by the loss function and data distribution via a Bayes quotient. Experiments on synthetic and real data (iNaturalist) illustrate the trade-offs between sufficiency, minimality, and retained non-required information.

Representation learning is often described as preserving the information in an input that is relevant for prediction. This work asks what relevance means for a fixed supervised decision problem. A representation is defined to be Bayes-sufficient for a joint distribution and loss if some prediction head can use it to implement a Bayes-optimal action rule. This makes the target information loss-dependent. In the almost-surely unique Bayes-action case, the relevant object is a Bayes quotient, which identifies inputs that require the same Bayes-optimal action. A representation is sufficient when it refines this quotient, and Bayes-minimal when it is informationally equivalent to it. The framework connects naturally to property elicitation: zero-one loss requires the Bayes class, squared loss the conditional mean, Brier loss the conditional probability in binary prediction, and log loss or strictly proper scoring rules the predictive distribution. Controlled finite experiments, learned neural bottleneck experiments, and a real-data iNaturalist taxonomic refinement experiment illustrate the distinction between sufficiency, minimality, and retained non-required information. For a fixed supervised problem, the distribution and the loss determine the Bayes action, the Bayes action determines the quotient, and the quotient determines the minimal information required for Bayes-optimal prediction.

View on arXiv PDF

Similar