On the Consistency of Ordinal Regression Methods

Fabian Pedregosa 1, 2 Francis Bach 1, 2 Alexandre Gramfort 3
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, ENS Paris - École normale supérieure - Paris, CNRS - Centre National de la Recherche Scientifique, Inria de Paris
Abstract : Many of the ordinal regression models that have been proposed in the literature can be seen as methods that minimize a convex surrogate of the zero-one, absolute, or squared loss functions. A key property that allows to study the statistical implications of such approximations is that of Fisher consistency. Fisher consistency is a desirable property for surrogate loss functions and implies that in the population setting, i.e., if the probability distribution that generates the data were available, then optimization of the surrogate would yield the best possible model. In this paper we will characterize the Fisher consistency of a rich family of surrogate loss functions used in the context of ordinal regression, including support vector ordinal regression, ORBoosting and least absolute deviation. We will see that, for a family of surrogate loss functions that subsumes support vector ordinal regression and ORBoosting, consistency can be fully characterized by the derivative of a real-valued function at zero, as happens for convex margin-based surrogates in binary classification. We also derive excess risk bounds for a surrogate of the absolute error that generalize existing risk bounds for binary classification. Finally, our analysis suggests a novel surrogate of the squared error loss. We compare this novel surrogate with competing approaches on 9 different datasets. Our method shows to be highly competitive in practice, outperforming the least squares loss on 7 out of 9 datasets.
Type de document :
Article dans une revue
Journal of Machine Learning Research, Journal of Machine Learning Research, 2017, 18, pp.1 - 35
Liste complète des métadonnées

https://hal.inria.fr/hal-01054942
Contributeur : Fabian Pedregosa <>
Soumis le : lundi 19 juin 2017 - 20:29:14
Dernière modification le : jeudi 11 janvier 2018 - 06:28:04
Document(s) archivé(s) le : vendredi 15 décembre 2017 - 20:17:00

Fichiers

15-495.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

  • HAL Id : hal-01054942, version 4
  • ARXIV : 1408.2327

Collections

Citation

Fabian Pedregosa, Francis Bach, Alexandre Gramfort. On the Consistency of Ordinal Regression Methods. Journal of Machine Learning Research, Journal of Machine Learning Research, 2017, 18, pp.1 - 35. 〈hal-01054942v4〉

Partager

Métriques

Consultations de la notice

297

Téléchargements de fichiers

69