inria-00356262, version 1
Incremental Basis Function Expansion in Reinforcement Learning using Cascade-Correlation Networks
Sertan Girgin 1, 2Philippe Preux 1, 2, 3
8th International Conference on Machine Learning and Applications (2008)
Résumé : In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data in a more informative form that facilitates and improves subsequent steps. As a ''good'' set of basis functions result in better solutions and defining such functions becomes a challenge with increasing problem complexity, it is beneficial to be able to generate them automatically. In this paper, we propose a new approach based on Bellman residual for constructing basis functions using cascade-correlation learning architecture. We show how this approach can be applied to Least Squares Policy Iteration algorithm in order to obtain a better approximation of the value function, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically on some benchmark problems.
- 1 : SEQUEL (INRIA Futurs)
- INRIA – CNRS : UMR8146 – Université Lille I - Sciences et technologies – Université Lille III - Sciences humaines et sociales – Ecole Centrale de Lille
- 2 : Laboratoire d'Informatique Fondamentale de Lille (LIFL)
- CNRS : UMR8022 – Université Lille I - Sciences et technologies – Université Lille III - Sciences humaines et sociales – INRIA
- 3 : GRAPPA (LIFL)
- CNRS : UMR8022 – Université Lille III - Sciences humaines et sociales – Université Lille I - Sciences et technologies
- Domaine : Statistiques/Machine Learning
Informatique/Réseau de neurones
Informatique/Intelligence artificielle
- inria-00356262, version 1
- http://hal.inria.fr/inria-00356262
- oai:hal.inria.fr:inria-00356262
- Contributeur : Preux Philippe
- Soumis le : Jeudi 8 Novembre 2012, 15:33:10
- Dernière modification le : Vendredi 9 Novembre 2012, 08:17:00






Documents associés
Exporter