LGOCMLOct 28, 2019

Differentiable Convex Optimization Layers

arXiv:1910.12430v1858 citations
Originality Incremental advance
AI Analysis

This lowers the barrier for using convex optimization in differentiable programs, benefiting researchers and practitioners in machine learning and control, though it is incremental as it builds on existing differentiable optimization frameworks.

The paper tackles the rigidity of existing differentiable optimization layers by proposing a method to differentiate through disciplined convex programs, enabling end-to-end analytical differentiation and implementing it in CVXPY, PyTorch, and TensorFlow, with competitive execution times compared to specialized solvers.

Recent work has shown how to embed differentiable optimization problems (that is, problems whose solutions can be backpropagated through) as layers within deep learning architectures. This method provides a useful inductive bias for certain problems, but existing software for differentiable optimization layers is rigid and difficult to apply to new settings. In this paper, we propose an approach to differentiating through disciplined convex programs, a subclass of convex optimization problems used by domain-specific languages (DSLs) for convex optimization. We introduce disciplined parametrized programming, a subset of disciplined convex programming, and we show that every disciplined parametrized program can be represented as the composition of an affine map from parameters to problem data, a solver, and an affine map from the solver's solution to a solution of the original problem (a new form we refer to as affine-solver-affine form). We then demonstrate how to efficiently differentiate through each of these components, allowing for end-to-end analytical differentiation through the entire convex program. We implement our methodology in version 1.1 of CVXPY, a popular Python-embedded DSL for convex optimization, and additionally implement differentiable layers for disciplined convex programs in PyTorch and TensorFlow 2.0. Our implementation significantly lowers the barrier to using convex optimization problems in differentiable programs. We present applications in linear machine learning models and in stochastic control, and we show that our layer is competitive (in execution time) compared to specialized differentiable solvers from past work.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes