Decentralized gradient methods: does topology matter?

Giovanni Neglia; Chuan Xu; Don Towsley; Gianmarco Calbi

Communication Dans Un Congrès Année : 2020

Decentralized gradient methods: does topology matter?

(1) , (1) , (2) , (1)

1
2

Giovanni Neglia

Fonction : Auteur
PersonId : 1683
IdHAL : giovanni-neglia
ORCID : 0000-0001-8779-0620
IdRef : 18310966X

Network Engineering and Operations

Chuan Xu

Fonction : Auteur
PersonId : 748898
IdHAL : chuan-xu
IdRef : 224356755

Network Engineering and Operations

Don Towsley

Fonction : Auteur

Department of Computer Science [Amherst]

Gianmarco Calbi

Fonction : Auteur

Network Engineering and Operations

Résumé

Consensus-based distributed optimization methods have recently been advocated as alternatives to parameter server and ring all-reduce paradigms for large scale training of machine learning models. In this case, each worker maintains a local estimate of the optimal parameter vector and iteratively updates it by averaging the estimates obtained from its neighbors, and applying a correction on the basis of its local dataset. While theoretical results suggest that worker communication topology should have strong impact on the number of epochs needed to converge, previous experiments have shown the opposite conclusion. This paper sheds lights on this apparent contradiction and show how sparse topologies can lead to faster convergence even in the absence of communication delays.

Domaines

Apprentissage [cs.LG] Optimisation et contrôle [math.OC]

Fichier principal

AISTATS2020.pdf (5.84 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Giovanni Neglia : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02430485

Soumis le : mardi 7 janvier 2020-12:54:45

Dernière modification le : jeudi 14 mars 2024-03:13:13

Dates et versions

hal-02430485 , version 1 (07-01-2020)

Identifiants

HAL Id : hal-02430485 , version 1

Citer

Giovanni Neglia, Chuan Xu, Don Towsley, Gianmarco Calbi. Decentralized gradient methods: does topology matter?. AISTATS 2020 - 23rd International Conference on Artificial Intelligence and Statistics, Aug 2020, Palermo /Online, Italy. ⟨hal-02430485⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 INRIA IRISA INRIA2 TDS-MACS UR1-MATH-STIC UR1-UFR-ISTIC UNIV-COTEDAZUR UNIV-RENNES OPAL UR1-MATH-NUM

169 Consultations

254 Téléchargements

Decentralized gradient methods: does topology matter?

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager