Solving multi-echelon inventory problems using deep reinforcement learning with centralized control (CORS)

Name: Solving multi-echelon inventory problems using deep reinforcement learning with centralized control (CORS)
Start: 2023-05-29T09:00:00Z
End: 2023-05-31T17:00:00Z
Location: Montréal, Canada

Q Zhong, Y Adulyasak, M Cousineau, R Jans

Abstract

Multi-echelon inventory models aim to minimize the system-wide total cost in a multi-stage supply chain by applying a proper ordering policy to each stage. The optimal solution is known only when several strict assumptions regarding the cost structure are made. To solve scenarios where those assumptions are relaxed, we apply and compare three efficient deep reinforcement learning (DRL) algorithms, namely Deep Q-network, Advantage Actor-Critic and Twin Delayed Deep Deterministic Policy Gradient, to efficiently determine the inventory policy. We consider a serial supply chain as in the beer game, a classic multi-echelon inventory problem, and extend the application of DRL to the centralized decision-making setting which is more complex due to significantly larger state and action space. The experiments show that in both decentralized and centralized settings, the DRL agents learned policies with significant cost savings compared to benchmark heuristics.

Date

May 29, 2023 9:00 AM — May 31, 2023 5:00 PM

Event

2023 CORS / Optimization Days

Location

Montréal, Canada

reinforcement learning machine learning inventory management

Solving multi-echelon inventory problems using deep reinforcement learning with centralized control (CORS)

Abstract

Related