Skip to Main content Skip to Navigation
Conference papers

SULFR: Simulation of Urban Logistic For Reinforcement

Guillaume Bono 1, 2 Jilles Dibangoye 1 Laëtitia Matignon 1, 3 Florian Pereyron 4 Olivier Simonin 1
1 CHROMA - Robots coopératifs et adaptés à la présence humaine en environnements dynamiques
Inria Grenoble - Rhône-Alpes, CITI - CITI Centre of Innovation in Telecommunications and Integration of services
3 SyCoSMA - Systèmes Cognitifs et Systèmes Multi-Agents
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In urban logistics, various sources of uncertainty can invalidate pre-planned routes. In this context, a routing strategy that uses available information from the environment could help improve the overall performance of the routing process by dynamically choosing the next client at the online execution time. While static and deterministic testbeds for vehicle routing exist, their stochastic and dynamic counterparts are still missing. This paper proposes an interface to the microtraffic simulation package SUMO that implement a generative model of stochastic and dynamic vehicle routing problems. We formalize the latter using a reinforcement learning framework for semi-Markov decision processes. The resulting testbeds make it possible to compare single- and multi-agent reinforcement learning algorithms in customizable routing environments. We report our preliminary tests to evaluate a hand-crafted policy on some basic scenarios.
Complete list of metadata

Cited literature [21 references]  Display  Hide  Download
Contributor : Guillaume Bono Connect in order to contact the contributor
Submitted on : Friday, September 7, 2018 - 1:21:33 PM
Last modification on : Tuesday, January 11, 2022 - 8:52:35 AM


Files produced by the author(s)


  • HAL Id : hal-01847773, version 2


Guillaume Bono, Jilles Dibangoye, Laëtitia Matignon, Florian Pereyron, Olivier Simonin. SULFR: Simulation of Urban Logistic For Reinforcement. Workshop on Prediction and Generative Modeling in Reinforcement Learning, Jul 2018, Stockholm, Sweden. pp.1-5. ⟨hal-01847773v2⟩



Les métriques sont temporairement indisponibles