Using Deep Q-Learning (Alpha-Go algorithm) to solve routing problems in warehouse logistics
|Title||Using Deep Q-Learning (Alpha-Go algorithm) to solve routing problems in warehouse logistics|
|Summary||Solving the famous vehicle-routing problem using Deep Q-Learning|
|Keywords||vehicle routing problem, Reinforcement learning|
|Supervisor||Jennifer David, Thorsteinn Rögnvaldsson|
For a group of robots or forklifts involved in warehouses, Sony Mobile in Lund has a Masters Thesis project. The aim is to generate paths/routes for all vehicles according to a pre-determined schedule without any path-conflicts and schedule conflicts. This is an NP-hard problem termed as vehicle routing problem and has been solved using heuristic approaches. Here, we try to use Deep Q learning to solve it.