Adaptive Replication of Large-Scale Multi-agent Systems - Towards a Fault-Tolerant Multi-agent Platform

Abstract : In order to construct and deploy large-scale multi-agent systems, we must address one of the fundamental issues of distributed systems, the possibility of partial failures. This means that fault-tolerance is an inevitable issue for large-scale multi-agent systems. In this paper, we discuss the issues and propose an approach for supporting fault-tolerance of multi-agent systems. The starting idea is the application of replication strategies to agents, the most critical agents being replicated to prevent failures. As criticality of agents may evolve during the course of computation and problem solving, and as resources are bounded, we need to dynamically and automatically adapt the number of replicas of agents, in order to maximize their reliability and availability. We will describe our approach and related mechanisms for evaluating the criticality of a given agent (based on application-level semantic information, e.g. interdependences, and also system-level statistical information, e.g., communication load) and for deciding what strategy to apply (e.g., active or passive replication) and how to parameterize it (e.g., number of replicas). We also will report on experiments conducted with our prototype architecture (named DimaX).
Type de document :
Chapitre d'ouvrage
Alessandro Garcia and Ricardo Choren and Carlos Lucena and Paolo Giorgini and Tom Holvoet and Alexander Romanovsky. Software Engineering for Multi-Agent Systems IV. Research Issues and Practical Applications, 3914, Springer, pp.238-253, 2006, Lecture Notes in Computer Science, 〈10.1007/11738817_15〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00697432
Contributeur : Ist Rennes <>
Soumis le : mardi 15 mai 2012 - 12:39:11
Dernière modification le : mercredi 30 août 2017 - 01:12:06

Identifiants

Collections

Citation

Zahia Guessoum, Nora Faci, Jean-Pierre Briot. Adaptive Replication of Large-Scale Multi-agent Systems - Towards a Fault-Tolerant Multi-agent Platform. Alessandro Garcia and Ricardo Choren and Carlos Lucena and Paolo Giorgini and Tom Holvoet and Alexander Romanovsky. Software Engineering for Multi-Agent Systems IV. Research Issues and Practical Applications, 3914, Springer, pp.238-253, 2006, Lecture Notes in Computer Science, 〈10.1007/11738817_15〉. 〈hal-00697432〉

Partager

Métriques

Consultations de la notice

148