AILGMAMar 1, 2023

Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks

Harvard
arXiv:2303.00799v14 citationsh-index: 17
Originality Highly original
AI Analysis

This work addresses fairness and resource allocation in real-world applications like machine repair and patrol scheduling, offering a novel extension to existing restless bandit models.

The paper tackles the problem of intervention planning with resource constraints by introducing a multi-worker restless bandit setting with heterogeneous workers, aiming to maximize reward while ensuring fairness in workload distribution. The result is a new index-based policy that significantly improves fairness without substantial loss in reward, as demonstrated in evaluations with various cost structures.

Motivated by applications such as machine repair, project monitoring, and anti-poaching patrol scheduling, we study intervention planning of stochastic processes under resource constraints. This planning problem has previously been modeled as restless multi-armed bandits (RMAB), where each arm is an intervention-dependent Markov Decision Process. However, the existing literature assumes all intervention resources belong to a single uniform pool, limiting their applicability to real-world settings where interventions are carried out by a set of workers, each with their own costs, budgets, and intervention effects. In this work, we consider a novel RMAB setting, called multi-worker restless bandits (MWRMAB) with heterogeneous workers. The goal is to plan an intervention schedule that maximizes the expected reward while satisfying budget constraints on each worker as well as fairness in terms of the load assigned to each worker. Our contributions are two-fold: (1) we provide a multi-worker extension of the Whittle index to tackle heterogeneous costs and per-worker budget and (2) we develop an index-based scheduling policy to achieve fairness. Further, we evaluate our method on various cost structures and show that our method significantly outperforms other baselines in terms of fairness without sacrificing much in reward accumulated.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes