AIJul 26, 2025

Reinforcement Learning for Multi-Objective Multi-Echelon Supply Chain Optimisation

arXiv:2507.19788v11 citationsh-index: 2
Originality Incremental advance
AI Analysis

It addresses supply chain optimization for industries needing to manage trade-offs between competing objectives, though it is incremental as it builds on existing multi-objective RL methods.

This study tackled multi-objective supply chain optimization by developing a reinforcement learning model that balances economic, environmental, and social goals, achieving up to 75% higher hypervolume and eleven times denser solutions than benchmark methods.

This study develops a generalised multi-objective, multi-echelon supply chain optimisation model with non-stationary markets based on a Markov decision process, incorporating economic, environmental, and social considerations. The model is evaluated using a multi-objective reinforcement learning (RL) method, benchmarked against an originally single-objective RL algorithm modified with weighted sum using predefined weights, and a multi-objective evolutionary algorithm (MOEA)-based approach. We conduct experiments on varying network complexities, mimicking typical real-world challenges using a customisable simulator. The model determines production and delivery quantities across supply chain routes to achieve near-optimal trade-offs between competing objectives, approximating Pareto front sets. The results demonstrate that the primary approach provides the most balanced trade-off between optimality, diversity, and density, further enhanced with a shared experience buffer that allows knowledge transfer among policies. In complex settings, it achieves up to 75\% higher hypervolume than the MOEA-based method and generates solutions that are approximately eleven times denser, signifying better robustness, than those produced by the modified single-objective RL method. Moreover, it ensures stable production and inventory levels while minimising demand loss.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes