Skip to Main content Skip to Navigation
Journal articles

Kernel-Based Ensemble Learning in Python

Benjamin Guedj 1, 2, 3, 4 Bhargav Srinivasa Desikan 5 
3 MODAL - MOdel for Data Analysis and Learning
LPP - Laboratoire Paul Painlevé - UMR 8524, Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, METRICS - Evaluation des technologies de santé et des pratiques médicales - ULR 2694, Polytech Lille - École polytechnique universitaire de Lille
Abstract : We propose a new supervised learning algorithm, for classification and regression problems where two or more preliminary predictors are available. We introduce \texttt{KernelCobra}, a non-linear learning strategy for combining an arbitrary number of initial predictors. \texttt{KernelCobra} builds on the COBRA algorithm introduced by \citet{biau2016cobra}, which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalize this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and \texttt{KernelCobra} systematically outperforms the COBRA algorithm. While COBRA is intended for regression, \texttt{KernelCobra} deals with classification and regression. \texttt{KernelCobra} is included as part of the open source Python package \texttt{Pycobra} (0.2.4 and onward), introduced by \citet{guedj2018pycobra}. Numerical experiments assess the performance (in terms of pure prediction and computational complexity) of \texttt{KernelCobra} on real-life and synthetic datasets.
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download
Contributor : Benjamin Guedj Connect in order to contact the contributor
Submitted on : Tuesday, February 18, 2020 - 7:58:57 AM
Last modification on : Friday, July 8, 2022 - 10:09:47 AM


Files produced by the author(s)



Benjamin Guedj, Bhargav Srinivasa Desikan. Kernel-Based Ensemble Learning in Python. Information, MDPI, 2020, 11 (2), pp.63. ⟨10.3390/info11020063⟩. ⟨hal-02443097v2⟩



Record views


Files downloads