ML LG OCJul 27, 2022

Learning with Combinatorial Optimization Layers: a Probabilistic Approach

Guillaume Dalle, Léo Baty, Louis Bouvier, Axel Parmentier

arXiv:2207.13513v221.247 citationsh-index: 10Has Code

Originality Incremental advance

AI Analysis

This work addresses a practical bottleneck for researchers and practitioners in ML/operations research by providing a unified implementation framework for differentiable combinatorial optimization layers.

The paper tackles the challenge of integrating combinatorial optimization layers into machine learning pipelines by introducing a probabilistic perspective that enables approximate differentiation and structured losses, resulting in an open-source Julia package (InferOpt.jl) that works with arbitrary optimization algorithms and is demonstrated on four applications including pathfinding on video game maps.

Combinatorial optimization (CO) layers in machine learning (ML) pipelines are a powerful tool to tackle data-driven decision tasks, but they come with two main challenges. First, the solution of a CO problem often behaves as a piecewise constant function of its objective parameters. Given that ML pipelines are typically trained using stochastic gradient descent, the absence of slope information is very detrimental. Second, standard ML losses do not work well in combinatorial settings. A growing body of research addresses these challenges through diverse methods. Unfortunately, the lack of well-maintained implementations slows down the adoption of CO layers. In this paper, building upon previous works, we introduce a probabilistic perspective on CO layers, which lends itself naturally to approximate differentiation and the construction of structured losses. We recover many approaches from the literature as special cases, and we also derive new ones. Based on this unifying perspective, we present InferOpt.jl, an open-source Julia package that 1) allows turning any CO oracle with a linear objective into a differentiable layer, and 2) defines adequate losses to train pipelines containing such layers. Our library works with arbitrary optimization algorithms, and it is fully compatible with Julia's ML ecosystem. We demonstrate its abilities using a pathfinding problem on video game maps as guiding example, as well as three other applications from operations research.

View on arXiv PDF Code

Similar