Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Fast imbalanced binary classification: a moment-based approach

Edouard Grave 1, 2, 3 Laurent El Ghaoui 1 
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique - ENS Paris, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : In this paper, we consider the problem of imbalanced binary classification in which the number of negative examples is much larger than the number of positive examples. The two mainstream methods to deal with such problems are to assign different weights to negative and positive points or to subsample points from the negative class. In this paper, we propose a different approach: we represent the negative class by the two first moments of its probability distribution (the mean and the covariance), while still modeling the positive class by individual examples. Therefore, our formulation does not depend on the number of negative examples, making it suitable to highly imbalanced problems and scalable to large datasets. We demonstrate empirically, on a protein classification task and a text classification task, that our approach achieves similar statistical performance than the two mainstream approaches to imbalanced classification problems, while being more computationally efficient.
Document type :
Preprints, Working Papers, ...
Complete list of metadata
Contributor : Edouard Grave Connect in order to contact the contributor
Submitted on : Wednesday, November 26, 2014 - 8:01:21 PM
Last modification on : Thursday, March 17, 2022 - 10:08:44 AM
Long-term archiving on: : Friday, February 27, 2015 - 11:06:26 AM


Files produced by the author(s)


  • HAL Id : hal-01087452, version 1



Edouard Grave, Laurent El Ghaoui. Fast imbalanced binary classification: a moment-based approach. 2014. ⟨hal-01087452⟩



Record views


Files downloads