SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

Radu Ciucanu; Pascal Lafourcade; Gael Marcadet; Marta Soare

doi:10.1613/jair.1.13163

Article Dans Une Revue Journal of Artificial Intelligence Research Année : 2022

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

(1) , (2) , (2) , (1)

1
2

Radu Ciucanu

Fonction : Auteur
PersonId : 176966
IdHAL : radu-ciucanu
IdRef : 189245735

Laboratoire d'Informatique de Grenoble

Pascal Lafourcade

Fonction : Auteur
PersonId : 5561
IdHAL : pascalafourcade
ORCID : 0000-0002-4459-511X
IdRef : 109895355

Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes

Gael Marcadet

Fonction : Auteur
PersonId : 1125083
IdHAL : gamarcad
ORCID : 0000-0003-1194-1343

Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes

Marta Soare

Fonction : Auteur
PersonId : 488
IdHAL : marta-soare
IdRef : 191904015

Laboratoire d'Informatique de Grenoble

Résumé

The multi-armed bandit is a reinforcement learning model where a learning agent repeatedly chooses an action (pull a bandit arm) and the environment responds with a stochastic outcome (reward) coming from an unknown distribution associated with the chosen arm. Bandits have a wide-range of application such as Web recommendation systems. We address the cumulative reward maximization problem in a secure federated learning setting, where multiple data owners keep their data stored locally and collaborate under the coordination of a central orchestration server. We rely on cryptographic schemes and propose SAMBA, a generic framework for Secure federAted Multi-armed BAndits. Each data owner has data associated to a bandit arm and the bandit algorithm has to sequentially select which data owner is solicited at each time step. We instantiate SAMBA for five bandit algorithms. We show that SAMBA returns the same cumulative reward as the non-secure versions of bandit algorithms, while satisfying formally proven security properties. We also show that the overhead due to cryptographic primitives is linear in the size of the input, which is confirmed by our proof-of-concept implementation.https://www.jair.org/index.php/jair/article/view/13163

Mots clés

machine learning reinforcement learning distributed AI

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Cryptographie et sécurité [cs.CR] Base de données [cs.DB]

Radu Ciucanu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03553894

Soumis le : jeudi 3 février 2022-09:22:47

Dernière modification le : jeudi 4 avril 2024-21:41:38

Dates et versions

hal-03553894 , version 1 (03-02-2022)

Identifiants

HAL Id : hal-03553894 , version 1
DOI : 10.1613/jair.1.13163

Citer

Radu Ciucanu, Pascal Lafourcade, Gael Marcadet, Marta Soare. SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits. Journal of Artificial Intelligence Research, 2022, 73, pp.737--765. ⟨10.1613/jair.1.13163⟩. ⟨hal-03553894⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA PRES_CLERMONT CNRS LIG LIMOS MIAI ANR LIG_SIDCH CLERMONT-AUVERGNE-INP

190 Consultations

0 Téléchargements

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager