Skip to Main content Skip to Navigation
Journal articles

Kernel-Based Ensemble Learning in Python

Benjamin Guedj 1, 2, 3, 4 Bhargav Srinivasa Desikan 5
3 MODAL - MOdel for Data Analysis and Learning
Inria Lille - Nord Europe, LPP - Laboratoire Paul Painlevé - UMR 8524, METRICS - Evaluation des technologies de santé et des pratiques médicales - ULR 2694, Polytech Lille - École polytechnique universitaire de Lille, Université de Lille, Sciences et Technologies
Abstract : We propose a new supervised learning algorithm, for classification and regression problems where two or more preliminary predictors are available. We introduce \texttt{KernelCobra}, a non-linear learning strategy for combining an arbitrary number of initial predictors. \texttt{KernelCobra} builds on the COBRA algorithm introduced by \citet{biau2016cobra}, which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalize this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and \texttt{KernelCobra} systematically outperforms the COBRA algorithm. While COBRA is intended for regression, \texttt{KernelCobra} deals with classification and regression. \texttt{KernelCobra} is included as part of the open source Python package \texttt{Pycobra} (0.2.4 and onward), introduced by \citet{guedj2018pycobra}. Numerical experiments assess the performance (in terms of pure prediction and computational complexity) of \texttt{KernelCobra} on real-life and synthetic datasets.
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download

https://hal.inria.fr/hal-02443097
Contributor : Benjamin Guedj <>
Submitted on : Tuesday, February 18, 2020 - 7:58:57 AM
Last modification on : Tuesday, February 2, 2021 - 3:31:17 AM

File

information-11-00063.pdf
Files produced by the author(s)

Identifiers

Citation

Benjamin Guedj, Bhargav Srinivasa Desikan. Kernel-Based Ensemble Learning in Python. Information, MDPI, 2020, 11 (2), pp.63. ⟨10.3390/info11020063⟩. ⟨hal-02443097v2⟩

Share

Metrics

Record views

116

Files downloads

562