Skip to Main content Skip to Navigation
New interface
Reports (Research report)

Learning Very Large Configuration Spaces: What Matters for Linux Kernel Sizes

Abstract : Linux kernels are used in a wide variety of appliances, many of them having strong requirements on the kernel size due to constraints such as limited memory or instant boot. With more than ten thousands of configuration options to choose from, obtaining a suitable trade off between kernel size and functionality is an extremely hard problem. Developers, contributors, and users actually spend significant effort to document, understand, and eventually tune (combinations of) options for meeting a kernel size. In this paper, we investigate how machine learning can help explain what matters for predicting a given Linux kernel size. Unveiling what matters in such very large configuration space is challenging for two reasons: (1) whatever the time we spend on it, we can only build and measure a tiny fraction of possible kernel configurations; (2) the prediction model should be both accurate and interpretable. We compare different machine learning algorithms and demonstrate the benefits of specific feature encoding and selection methods to learn an accurate model that is fast to compute and simple to interpret. Our results are validated over 95,854 kernel configurations and show that we can achieve low prediction errors over a reduced set of options. We also show that we can extract interpretable information for refining documentation and experts' knowledge of Linux, or even assigning more sensible default values to options.
Document type :
Reports (Research report)
Complete list of metadata

Cited literature [88 references]  Display  Hide  Download
Contributor : Mathieu Acher Connect in order to contact the contributor
Submitted on : Sunday, October 13, 2019 - 11:24:46 PM
Last modification on : Wednesday, October 26, 2022 - 8:14:36 AM


Files produced by the author(s)


  • HAL Id : hal-02314830, version 1


Mathieu Acher, Hugo Martin, Juliana Alves Pereira, Arnaud Blouin, Jean-Marc Jézéquel, et al.. Learning Very Large Configuration Spaces: What Matters for Linux Kernel Sizes. [Research Report] Inria Rennes - Bretagne Atlantique. 2019. ⟨hal-02314830⟩



Record views


Files downloads