The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

Anis Yazidi; Xuan Zhang; Lei Jiao; B. John Oommen

doi:10.1007/978-3-319-92007-8_38

Communication Dans Un Congrès Année : 2018

The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

(1) , (2) , (2) , (3)

1
2
3

Anis Yazidi

Fonction : Auteur
PersonId : 1031963

Oslo and Akershus University College of Applied Sciences [Oslo]

Xuan Zhang

Fonction : Auteur
PersonId : 1033480

University of Agder

Lei Jiao

Fonction : Auteur
PersonId : 1016420

University of Agder

B. John Oommen

Fonction : Auteur
PersonId : 1033481

Carleton University

Résumé

Although the field of Learning Automata (LA) has made significant progress in the last four decades, the LA-based methods to tackle problems involving environments with a large number of actions are, in reality, relatively unresolved. The extension of the traditional LA (fixed structure, variable structure, discretized, and pursuit) to problems within this domain cannot be easily established when the number of actions is very large. This is because the dimensionality of the action probability vector is correspondingly large, and consequently, most components of the vector will, after a relatively short time, have values that are smaller than the machine accuracy permits, implying that they will never be chosen. This paper pioneers a solution that extends the continuous pursuit paradigm to such large-actioned problem domains. The beauty of the solution is that it is hierarchical, where all the actions offered by the environment reside as leaves of the hierarchy. Further, at every level, we merely require a two-action LA which automatically resolves the problem of dealing with arbitrarily small action probabilities. Additionally, since all the LA invoke the pursuit paradigm, the best action at every level trickles up towards the root. Thus, by invoking the property of the “max” operator, in which, the maximum of numerous maxima is the overall maximum, the hierarchy of LA converges to the optimal action. Apart from reporting the theoretical properties of the scheme, the paper contains extensive experimental results which demonstrate the power of the scheme and its computational advantages. As far as we know, there are no comparable results in the field of LA.

Mots clés

Learning Automata (LA) Pursuit LA Estimator-based LA Hierarchical LA LA with large number of actions

Domaines

Informatique [cs]

Fichier principal

467708_1_En_38_Chapter.pdf (154.59 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01821050

Soumis le : vendredi 22 juin 2018-11:45:06

Dernière modification le : vendredi 5 juin 2020-17:10:10

Archivage à long terme le : mardi 25 septembre 2018-12:40:24

Dates et versions

hal-01821050 , version 1 (22-06-2018)

Licence

Paternité

Identifiants

HAL Id : hal-01821050 , version 1
DOI : 10.1007/978-3-319-92007-8_38

Citer

Anis Yazidi, Xuan Zhang, Lei Jiao, B. John Oommen. The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions. 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2018, Rhodes, Greece. pp.451-461, ⟨10.1007/978-3-319-92007-8_38⟩. ⟨hal-01821050⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-WG IFIP-TC12 IFIP-AIAI IFIP-WG12-5 IFIP-AICT-519

59 Consultations

28 Téléchargements

The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager