inria-00187997, version 2
Feature Discovery in Reinforcement Learning using Genetic Programming
Sertan Girgin
1Philippe Preux 1
(2007)
Résumé : The goal of reinforcement learning is to find a policy that maximizes the expected reward accumulated by an agent over time based on its interactions with the environment; to this end, a function of the state of the agent has to be learned. It is often the case that states are better characterized by a set of features. However, finding a ''good'' set of features is generally a tedious task which requires a good domain knowledge. In this paper, we propose a genetic programming based approach for feature discovery in reinforcement learning. A population of individuals, each representing a set of features is evolved, and individuals are evaluated by their average performance on short reinforcement learning trials. The results of experiments conducted on several benchmark problems demonstrate that the resulting features allow the agent to learn better policies in a reduced amount of episodes.
- 1 : SEQUEL (INRIA Futurs)
- INRIA – CNRS : UMR8022 – CNRS : UMR8146 – Université Lille 1 - Sciences et Technologies – Université Charles de Gaulle - Lille III – Ecole Centrale de Lille
- Collaboration : Grid'5000
- Domaine : Informatique/Apprentissage
- Mots-clés : feature discovery – reinforcement learning – genetic programming
- Versions disponibles : v1 (15-11-2007) v2 (19-11-2007)
- inria-00187997, version 2
- http://hal.inria.fr/inria-00187997
- oai:hal.inria.fr:inria-00187997
- Contributeur : Rapport De Recherche Inria
- Soumis le : Lundi 19 Novembre 2007, 10:25:48
- Dernière modification le : Lundi 23 Avril 2012, 15:28:05






Documents associés

Exporter