multi-armed bandit

Learning implicit multiple time windows in the traveling salesman problem

Classically, researchers working in vehicle routing problems (VRPs) assume that the structure of the problem is known (i.e., objective function, constraints, parameters). However, recent studies have highlighted the gap between the routes offered by …