Skip to Main content Skip to Navigation

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Wenjie Zheng 1 Aurélien Bellet 2 Patrick Gallinari 1
1 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
2 MAGNET - Machine Learning in Information Networks
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189
Abstract : We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank-Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted from a deep neural network.
Complete list of metadata

Cited literature [49 references]  Display  Hide  Download
Contributor : Aurélien Bellet Connect in order to contact the contributor
Submitted on : Saturday, December 23, 2017 - 7:18:23 AM
Last modification on : Friday, January 21, 2022 - 3:19:54 AM
Long-term archiving on: : Saturday, March 24, 2018 - 12:32:01 PM


Files produced by the author(s)



Wenjie Zheng, Aurélien Bellet, Patrick Gallinari. A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm. [Research Report] Inria Lille. 2017, pp.1-19. ⟨hal-01672066⟩



Les métriques sont temporairement indisponibles