Skip to Main content Skip to Navigation
Conference papers

The Impact of Generic Data Structures: Decoding the Role of Lists in the Linux Kernel

Abstract : The increasing adoption of the Linux kernel has been sustained by a large and constant maintenance effort, performed by a wide and heterogeneous base of contributors. One important problem that maintainers face in any code base is the rapid understanding of complex data structures. The Linux kernel is written in the C language, which enables the definition of arbitrarily uninformative datatypes, via the use of casts and pointer arithmetic, of which doubly linked lists are a prominent example. In this paper, we explore the advantages and disadvantages of such lists, for expressivity, for code understanding, and for code reliability. Based on our observations, we have developed a toolset that includes inference of descriptive list types and a tool for list visualization. Our tools identify more than 10,000 list fields and variables in recent Linux kernel releases and succeeds in typing 90%. We show how these tools could have been used to detect previously fixed bugs and identify 6 new ones.
Complete list of metadata

Cited literature [24 references]  Display  Hide  Download
Contributor : Eugene Volanschi Connect in order to contact the contributor
Submitted on : Thursday, September 10, 2020 - 10:10:22 AM
Last modification on : Thursday, September 29, 2022 - 4:53:59 AM



Nic Volanschi, Julia Lawall. The Impact of Generic Data Structures: Decoding the Role of Lists in the Linux Kernel. ASE 2020 - 35th IEEE/ACM International Conference on Automated Software Engineering, Sep 2020, Melbourne / Virtual, Australia. ⟨10.1145/3324884.3416635⟩. ⟨hal-02931554v2⟩



Record views


Files downloads