Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance

Giulia Luise; Alessandro Rudi; Massimiliano Pontil; Carlo Ciliberto

Communication Dans Un Congrès Année : 2018

Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance

(1) , (2, 3, 4) , (1, 5) , (1)

1
2
3
4
5

Giulia Luise

Fonction : Auteur

University College of London [London]

Alessandro Rudi

Fonction : Auteur
PersonId : 21784
IdHAL : alessandro-rudi
ORCID : 0000-0002-3879-7794
IdRef : 240218043

Université Paris Sciences et Lettres

Département d'informatique - ENS Paris

Statistical Machine Learning and Parsimony

Massimiliano Pontil

Fonction : Auteur

University College of London [London]

Istituto Italiano di Tecnologia

Carlo Ciliberto

Fonction : Auteur

University College of London [London]

Résumé

Applications of optimal transport have recently gained remarkable attention thanks to the computational advantages of entropic regularization. However, in most situations the Sinkhorn approximation of the Wasserstein distance is replaced by a regularized version that is less accurate but easy to differentiate. In this work we characterize the differential properties of the original Sinkhorn distance, proving that it enjoys the same smoothness as its regularized version and we explicitly provide an efficient algorithm to compute its gradient. We show that this result benefits both theory and applications: on one hand, high order smoothness confers statistical guarantees to learning with Wasserstein approximations. On the other hand, the gradient formula allows us to efficiently solve learning and optimization problems in practice. Promising preliminary experiments complement our analysis.

Domaines

Apprentissage [cs.LG] Optimisation et contrôle [math.OC]

Fichier principal

arxivG3.pdf (413.75 Ko)

5wlambda.pdf (7.3 Ko)

5wtilde.pdf (6.32 Ko)

barywlambda.pdf (9.93 Ko)

barywtilde.pdf (6.7 Ko)

datasetcrop.png (10.45 Ko)

deltasbary2.pdf (6.83 Ko)

ellipinput.jpg (111.43 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alessandro Rudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01958887

Soumis le : mercredi 19 décembre 2018-00:10:15

Dernière modification le : vendredi 19 avril 2024-16:18:56

Archivage à long terme le : mercredi 20 mars 2019-15:29:03

Dates et versions

hal-01958887 , version 1 (19-12-2018)

Identifiants

HAL Id : hal-01958887 , version 1
ARXIV : 1805.11897

Citer

Giulia Luise, Alessandro Rudi, Massimiliano Pontil, Carlo Ciliberto. Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance. NIPS 2018 - Advances in Neural Information Processing Systems, Dec 2018, Montreal, Canada. pp.5864-5874. ⟨hal-01958887⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INSMI INRIA2 TDS-MACS PSL

81 Consultations

277 Téléchargements

Differential Properties of Sinkhorn Approximation for Learning with Wasserstein Distance

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager