D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

Aurélien Bellet; Anne-Marie Kermarrec; Erick Lavoie

Pré-Publication, Document De Travail Année : 2021

D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

(1) , (2) , (2)

1
2

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Machine Learning in Information Networks

Anne-Marie Kermarrec

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Erick Lavoie

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Résumé

The convergence speed of machine learning models trained with Federated Learning is significantly affected by heterogeneous data partitions, even more so in a fully decentralized setting without a central server. In this paper, we show that the impact of label distribution skew, an important type of data heterogeneity, can be significantly reduced by carefully designing the underlying communication topology. We present D-Cliques, a novel topology that reduces gradient bias by grouping nodes in sparsely interconnected cliques such that the label distribution in a clique is representative of the global label distribution. We also show how to adapt the updates of decentralized SGD to obtain unbiased gradients and implement an effective momentum with D-Cliques. Our extensive empirical evaluation on MNIST and CIFAR10 demonstrates that our approach provides similar convergence speed as a fully-connected topology, which provides the best convergence in a data heterogeneous setting, with a significant reduction in the number of edges and messages. In a 1000-node topology, D-Cliques require 98% less edges and 96% less total messages, with further possible gains using a small-world topology across cliques.

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

2104.07365.pdf (2.76 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Aurélien Bellet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03498160

Soumis le : lundi 20 décembre 2021-20:05:55

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-03498160 , version 1 (20-12-2021)

Identifiants

HAL Id : hal-03498160 , version 1
ARXIV : 2104.07365

Citer

Aurélien Bellet, Anne-Marie Kermarrec, Erick Lavoie. D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning. 2021. ⟨hal-03498160⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 INRIA-EPFL CRISTAL-MAGNET UNIV-LILLE ANR

85 Consultations

98 Téléchargements

D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager