Skip to Main content Skip to Navigation
Conference papers

Feature Subset Selection for Learning Huge Configuration Spaces: The case of Linux Kernel Size

Abstract : Linux kernels are used in a wide variety of appliances, many of them having strong requirements on the kernel size due to constraints such as limited memory or instant boot. With more than nine thousands of configuration options to choose from, developers and users of Linux actually spend significant effort to document, understand, and eventually tune (combinations of) options for meeting a kernel size. In this paper, we describe a large-scale endeavour automating this task and predicting a given Linux kernel binary size out of unmeasured configurations. We first experiment that state-of-theart solutions specifically made for configurable systems such as performance-influence models cannot cope with that number of options, suggesting that software product line techniques may need to be adapted to such huge configuration spaces. We then show that tree-based feature selection can learn a model achieving low prediction errors over a reduced set of options. The resulting model, trained on 95 854 kernel configurations, is fast to compute, simple to interpret and even outperforms the accuracy of learning without feature selection.
Complete list of metadata
Contributor : Luc Lesoil Connect in order to contact the contributor
Submitted on : Monday, July 11, 2022 - 6:27:25 PM
Last modification on : Friday, August 5, 2022 - 2:54:52 PM


Files produced by the author(s)



Mathieu Acher, Hugo Martin, Juliana Alves Pereira, Luc Lesoil, Arnaud Blouin, et al.. Feature Subset Selection for Learning Huge Configuration Spaces: The case of Linux Kernel Size. SPLC 2022 - 26th ACM International Systems and Software Product Line Conference, Sep 2022, Graz, Austria. pp.1-12, ⟨10.1145/3546932.3546997⟩. ⟨hal-03720273⟩



Record views


Files downloads