A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Wenjie Zheng; Aurélien Bellet; Patrick Gallinari

doi:10.1007/s10994-018-5713-5

Rapport (Rapport De Recherche) Année : 2017

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

(1) , (2) , (1)

1
2

Wenjie Zheng

Fonction : Auteur

Machine Learning and Information Access

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Machine Learning in Information Networks

Patrick Gallinari

Fonction : Auteur
PersonId : 751615
IdHAL : patrick-gallinari
ORCID : 0000-0001-9060-9001
IdRef : 070709076

Machine Learning and Information Access

Résumé

We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank-Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted from a deep neural network.

Mots clés

Frank–Wolfe algorithm Low-rank learning Distributed optimization Trace norm Multi-task learning Multinomial logistic regression

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML] Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

main.pdf (533.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aurélien Bellet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01672066

Soumis le : samedi 23 décembre 2017-07:18:23

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Archivage à long terme le : samedi 24 mars 2018-12:32:01

Dates et versions

hal-01672066 , version 1 (23-12-2017)

Identifiants

HAL Id : hal-01672066 , version 1
ARXIV : 1712.07495
DOI : 10.1007/s10994-018-5713-5

Citer

Wenjie Zheng, Aurélien Bellet, Patrick Gallinari. A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm. [Research Report] Inria Lille. 2017, pp.1-19. ⟨hal-01672066⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC UNIV-LILLE3 CNRS INRIA LIP6 CRISTAL INRIA2 CRISTAL-MAGNET LARA UNIV-LILLE SORBONNE-UNIVERSITE SU-SCIENCES ANR

461 Consultations

134 Téléchargements

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager