Inferring DQN structure for high-dimensional continuous control - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Inferring DQN structure for high-dimensional continuous control

Résumé

Despite recent advancements in the field of Deep Reinforcement Learning, Deep Q-network (DQN) models still show lackluster performance on problems with high-dimensional action spaces. The problem is even more pronounced for cases with high-dimensional continuous action spaces due to combinatorial increase in the number of the outputs. Recent works approach the problem by dividing the network into multiple parallel or sequential (action) modules responsible for different discretized actions. However there are drawbacks to both the parallel and the sequential approaches, i.e. parallel module architectures lack coordination between action modules, leading to extra complexity in the task, while a sequential structure can result in the vanishing gradients problem and exploding parameter space. In this work we show that the compositional structure of the action modules has a significant impact on the model performance, we propose a novel approach to infer the network structure for DQN models operating with high-dimensional continuous actions. Our method is based on uncertainty estimation techniques and yields substantially higher scores for MuJoCo environments with high-dimensional continuous action spaces, as well as a realistic AAA sailing simulator game.
Fichier principal
Vignette du fichier
6024-Paper.pdf (2.91 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02984307 , version 1 (30-10-2020)

Identifiants

  • HAL Id : hal-02984307 , version 1

Citer

Andrey Sakryukin, Chedy Raïssi, Mohan S Kankanhalli. Inferring DQN structure for high-dimensional continuous control. 2020 International Conference on Machine Learning, Jul 2020, Vienna, Austria. ⟨hal-02984307⟩

Collections

INRIA INRIA2
35 Consultations
275 Téléchargements

Partager

Gmail Facebook X LinkedIn More