G. E. Batista, A. L. Bazzan, and M. C. Monard, Balancing training data for automated annotation of keywords: a case study, WOB, pp.10-18, 2003.

N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, SMOTE: synthetic minority over-sampling technique, Journal of artificial intelligence research, pp.321-357, 2002.

A. Dal-pozzolo, O. Caelen, S. Waterschoot, and G. Bontempi, Racing for Unbalanced Methods Selection, International Conference on Intelligent Data Engineering and Automated Learning, pp.24-31, 2013.
DOI : 10.1007/978-3-642-41278-3_4

H. Han, W. Wang, and B. Mao, Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning, International Conference on Intelligent Computing, pp.878-887, 2005.
DOI : 10.1007/11538059_91

P. Hart, The condensed nearest neighbor rule Information Theory, IEEE Transactions on, vol.14, issue.3, pp.515-516, 1968.
DOI : 10.1109/tit.1968.1054155

H. He and E. Garcia, Learning from imbalanced data. Knowledge and Data Engineering, IEEE Transactions on, vol.21, issue.9, pp.1263-1284, 2009.

H. He, Y. Bai, A. Edwardo, S. Garcia, and . Li, Adasyn: Adaptive synthetic sampling approach for imbalanced learning, IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), pp.1322-1328, 2008.

M. Kubat and S. Matwin, Addressing the curse of imbalanced training sets: one-sided selection, International Conference in Machine Learning, pp.179-186, 1997.

M. Kuhn, Caret: classification and regression training, Astrophysics Source Code Library, vol.1, p.5003, 2015.

J. Laurikkala, Improving Identification of Difficult Small Classes by Balancing Class Distribution, 2001.
DOI : 10.1007/3-540-48229-6_9

X. Liu, J. Wu, and Z. Zhou, Exploratory undersampling for class-imbalance learning, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol.39, issue.2, pp.539-550, 2009.

I. Mani and I. Zhang, knn approach to unbalanced data distributions: a case study involving information extraction, Proceedings of Workshop on Learning from Imbalanced Datasets, 2003.

H. M. Nguyen, E. W. Cooper, and K. Kamei, Borderline over-sampling for imbalanced data classification, International Journal of Knowledge Engineering and Soft Data Paradigms, vol.3, issue.1, pp.4-21, 2011.
DOI : 10.1504/IJKESDP.2011.039875

URL : http://ir.lib.hiroshima-u.ac.jp/files/public/2/28413/20141016164340528778/A1005.pdf

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

R. C. Prati, G. E. Batista, and M. C. Monard, Data mining with imbalanced class distributions: concepts and methods, Indian International Conference Artificial Intelligence, pp.359-376, 2009.

M. Rastgoo, G. Lemaitre, J. Massich, O. Morel, F. Marzani et al., Tackling the problem of data imbalancing for melanoma classification, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01250949

M. R. Smith, C. Martinez, and . Giraud-carrier, An instance level analysis of data complexity, Machine Learning, vol.8, issue.7, pp.225-256, 2014.
DOI : 10.1007/s10994-013-5422-z

S. C. Sonnenburg, S. Henschel, C. Widmer, J. Behr, A. Zien et al., The SHOGUN machine learning toolbox, Journal of Machine Learning Research, vol.11, pp.1799-1802, 2010.

I. Tomek, Two modifications of CNN. Systems, Man, and Cybernetics, IEEE Transactions on, vol.6, pp.769-772, 1976.

L. Torgo, Data mining with R: learning with case studies, 2010.
DOI : 10.1201/b10328

D. L. Wilson, Asymptotic properties of nearest neighbor rules using edited data. Systems, Man and Cybernetics, IEEE Transactions on, issue.3, pp.408-421, 1972.

Q. Yang and X. Wu, 10 CHALLENGING PROBLEMS IN DATA MINING RESEARCH, International Journal of Information Technology & Decision Making, vol.05, issue.04, pp.597-604, 2006.
DOI : 10.1142/S0219622006002258