LGMLJul 8, 2023

On Regularization and Inference with Label Constraints

arXiv:2307.03886v16 citationsh-index: 98
Originality Synthesis-oriented
AI Analysis

This work addresses how to effectively incorporate prior knowledge into structured prediction, which is incremental as it builds on existing strategies.

The paper tackles the problem of encoding label constraints in machine learning by comparing regularization and constrained inference, showing that regularization narrows the generalization gap but introduces bias, while constrained inference reduces population risk by correcting violations.

Prior knowledge and symbolic rules in machine learning are often expressed in the form of label constraints, especially in structured prediction problems. In this work, we compare two common strategies for encoding label constraints in a machine learning pipeline, regularization with constraints and constrained inference, by quantifying their impact on model performance. For regularization, we show that it narrows the generalization gap by precluding models that are inconsistent with the constraints. However, its preference for small violations introduces a bias toward a suboptimal model. For constrained inference, we show that it reduces the population risk by correcting a model's violation, and hence turns the violation into an advantage. Given these differences, we further explore the use of two approaches together and propose conditions for constrained inference to compensate for the bias introduced by regularization, aiming to improve both the model complexity and optimal risk.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes