Abstract : Some agents have to face multiple objectives simultaneously. In such cases, and considering partially observable environments, classical Reinforcement Learning (RL) is prone to fall in pretty low local optima, only learning straightforward behaviors. We present here a method that tries to identify and learn independent ``basic'' behaviors solving separate tasks the agent has to face. Using a combination of these behaviors (an action-selection algorithm), the agent is then able to efficiently deal with various complex goals in complex environments.
https://hal.inria.fr/inria-00099828
Contributeur : Publications Loria
<>
Soumis le : mardi 26 septembre 2006 - 09:41:36
Dernière modification le : jeudi 11 janvier 2018 - 06:19:51
Olivier Buffet, Alain Dutech. A Self-Made Agent Based on Action-Selection. Sixth European Workshop on Reinforcement Learning - EWRL-6 2003, 2003, Nancy, France, pp.47-48, 2003. 〈inria-00099828〉