Inferring DQN structure for high-dimensional continuous control - Archive ouverte HAL Access content directly
Conference Papers Year :

Inferring DQN structure for high-dimensional continuous control

(1) , (2) , (1)
1
2

Abstract

Despite recent advancements in the field of Deep Reinforcement Learning, Deep Q-network (DQN) models still show lackluster performance on problems with high-dimensional action spaces. The problem is even more pronounced for cases with high-dimensional continuous action spaces due to combinatorial increase in the number of the outputs. Recent works approach the problem by dividing the network into multiple parallel or sequential (action) modules responsible for different discretized actions. However there are drawbacks to both the parallel and the sequential approaches, i.e. parallel module architectures lack coordination between action modules, leading to extra complexity in the task, while a sequential structure can result in the vanishing gradients problem and exploding parameter space. In this work we show that the compositional structure of the action modules has a significant impact on the model performance, we propose a novel approach to infer the network structure for DQN models operating with high-dimensional continuous actions. Our method is based on uncertainty estimation techniques and yields substantially higher scores for MuJoCo environments with high-dimensional continuous action spaces, as well as a realistic AAA sailing simulator game.
Fichier principal
Vignette du fichier
6024-Paper.pdf (2.91 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02984307 , version 1 (30-10-2020)

Identifiers

  • HAL Id : hal-02984307 , version 1

Cite

Andrey Sakryukin, Chedy Raïssi, Mohan S Kankanhalli. Inferring DQN structure for high-dimensional continuous control. 2020 International Conference on Machine Learning, Jul 2020, Vienna, Austria. ⟨hal-02984307⟩

Collections

INRIA INRIA2
34 View
260 Download

Share

Gmail Facebook Twitter LinkedIn More