LGAIGTOCMLSep 11, 2018

Optimization with Non-Differentiable Constraints with Applications to Fairness, Recall, Churn, and Other Goals

arXiv:1809.04198v1183 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of enforcing diverse policy goals like fairness and recall in machine learning models, though it is an incremental improvement over existing constrained optimization methods.

The paper tackles the problem of training non-convex models with non-differentiable constraints, such as fairness metrics, by introducing a proxy-Lagrangian formulation and algorithm that produces a stochastic classifier with theoretical guarantees for approximate feasibility and optimality.

We show that many machine learning goals, such as improved fairness metrics, can be expressed as constraints on the model's predictions, which we call rate constraints. We study the problem of training non-convex models subject to these rate constraints (or any non-convex and non-differentiable constraints). In the non-convex setting, the standard approach of Lagrange multipliers may fail. Furthermore, if the constraints are non-differentiable, then one cannot optimize the Lagrangian with gradient-based methods. To solve these issues, we introduce the proxy-Lagrangian formulation. This new formulation leads to an algorithm that produces a stochastic classifier by playing a two-player non-zero-sum game solving for what we call a semi-coarse correlated equilibrium, which in turn corresponds to an approximately optimal and feasible solution to the constrained optimization problem. We then give a procedure which shrinks the randomized solution down to one that is a mixture of at most $m+1$ deterministic solutions, given $m$ constraints. This culminates in algorithms that can solve non-convex constrained optimization problems with possibly non-differentiable and non-convex constraints with theoretical guarantees. We provide extensive experimental results enforcing a wide range of policy goals including different fairness metrics, and other goals on accuracy, coverage, recall, and churn.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes