Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

An important goal of research in Deep Reinforcement Learning in mobile robotics is to train agents capable of solving complex tasks, which require a high level of scene understanding and reasoning from an egocentric perspective. When trained from simulations, optimal environments should satisfy a currently unobtainable combination of high-fidelity photographic observations, massive amounts of different environment configurations and fast simulation speeds. In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations. We present a suite of tasks requiring complex reasoning and exploration in continuous, partially observable 3D environments. The objective is to provide challenging scenarios and a robust baseline agent architecture that can be trained on mid-range consumer hardware in under 24h. Our scenarios combine two key advantages: (i) they are based on a simple but highly efficient 3D environment (ViZ-Doom) which allows high speed simulation (12000fps); (ii) the scenarios provide the user with a range of difficulty settings, in order to identify the limitations of current state of the art algorithms and network ar-chitectures. We aim to increase accessibility to the field of Deep-RL by providing baselines for challenging scenarios where new ideas can be iterated on quickly. We argue that the community should be able to address challenging problems in reasoning of mobile agents without the need for a large compute infrastructure. Code for the generation of scenarios and training of baselines is available online at the following repository 1 .

Domaines

Informatique [cs]

Fichier principal

1904.01806.pdf (4.98 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Edward Beeching : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02934130

Soumis le : mercredi 9 septembre 2020-08:52:17

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Archivage à long terme le : vendredi 4 décembre 2020-19:37:04

Dates et versions

hal-02934130 , version 1 (09-09-2020)

Identifiants

HAL Id : hal-02934130 , version 1
DOI : 10.1109/ICPR48806.2021.9412212

Citer

Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin. Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer. ICPR 2020 - 25th International Conference on Pattern Recognition, Dec 2020, Milan, Italy. pp.1-16, ⟨10.1109/ICPR48806.2021.9412212⟩. ⟨hal-02934130⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INRIA2 CITI INSA-GROUPE UDL

190 Consultations

161 Téléchargements