Automatic Generation of an Agent's Basic Behaviors

Olivier Buffet 1 Alain Dutech 1 François Charpillet 1
1 MAIA - Autonomous intelligent machine
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The agent approach, as seen by \cite{Russell95}, intends to design ``intelligent'' behaviors. Yet, Reinforcement Learning (RL) methods often fail when confronted with complex tasks. We are therefore trying to develop a methodology for the automated design of agents (in the framework of Markov Decision Processes) in the case where the global task can be decomposed into simpler -possibly concurrent- sub-tasks. Our main idea is to automatically combine basic behaviors using RL methods. This led us to propose two complementary mechanisms presented in the current paper. The first mechanism builds a global policy using a weighted combination of basic policies (which are reusable), the weights being learned by the agent (using Simulated Annealing in our case). An agent designed this way is highly scalable as, without further refinement of the global behavior, it can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask. The second mechanism aims at creating new basic behaviors for combination. It is based on an incremental learning method that builds on the approximate solution obtained through the combination of older behaviors.
Type de document :
Communication dans un congrès
Rosenschein, Sandholm, Wooldridge and Yokoo. Second International Joint Conference on Autonomous Agents and Multi-Agent Systems - AAMAS'03, 2003, Melbourne, Victoria, Australie, ACM press, pp.875-882, 2003
Liste complète des métadonnées

https://hal.inria.fr/inria-00099817
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 09:41:31
Dernière modification le : jeudi 11 janvier 2018 - 06:19:51

Identifiants

  • HAL Id : inria-00099817, version 1

Collections

Citation

Olivier Buffet, Alain Dutech, François Charpillet. Automatic Generation of an Agent's Basic Behaviors. Rosenschein, Sandholm, Wooldridge and Yokoo. Second International Joint Conference on Autonomous Agents and Multi-Agent Systems - AAMAS'03, 2003, Melbourne, Victoria, Australie, ACM press, pp.875-882, 2003. 〈inria-00099817〉

Partager

Métriques

Consultations de la notice

170