LGMay 29

Constrained Multi-Objective Reinforcement Learning with Max-Min Criterion

arXiv:2605.3138852.1
AI Analysis

This work addresses the problem of integrating fairness (max-min criterion) and constraint satisfaction in multi-objective reinforcement learning, which is important for applications requiring both balanced performance and adherence to safety or operational limits.

This paper proposes a multi-objective reinforcement learning framework that combines the max-min criterion with explicit constraint satisfaction. The method was validated through convergence analysis and experiments in tabular settings, and demonstrated practical relevance in simulated building thermal control, multi-objective locomotion control, and greenhouse-gas-emission-aware traffic management.

Multi-Objective Reinforcement Learning (MORL) extends standard RL by optimizing policies with respect to multiple, often conflicting, objectives. While max-min MORL has emerged as an effective approach for promoting fairness, its applicability remains limited, particularly when constraints must be incorporated. In this paper, we propose a MORL framework that integrates the max-min criterion with explicit constraint satisfaction. We establish a theoretical foundation for the proposed framework and validate the resulting algorithm through convergence analysis and experiments in tabular settings. We further demonstrate the practical relevance of our approach in simulated building thermal control, multi-objective locomotion control, and greenhouse-gas-emission-aware traffic management. Across these domains, our method effectively balances fairness and constraint satisfaction in multi-objective decision-making.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes