Skip to Main content Skip to Navigation
Conference papers

The Impact of Generic Data Structures: Decoding the Role of Lists in the Linux Kernel

Abstract : The increasing adoption of the Linux kernel has been sustained by a large and constant maintenance effort, performed by a wide and heterogeneous base of contributors. One important problem that maintainers face in any code base is the rapid understanding of complex data structures. The Linux kernel is written in the C language, which enables the definition of arbitrarily uninformative datatypes, via the use of casts and pointer arithmetic, of which doubly linked lists are a prominent example. In this paper, we explore the advantages and disadvantages of such lists, for expressivity, for code understanding, and for code reliability. Based on our observations, we have developed a toolset that includes inference of descriptive list types and a tool for list visualization. Our tools identify more than 10,000 list fields and variables in recent Linux kernel releases and succeeds in typing 90%. We show how these tools could have been used to detect previously fixed bugs and identify 6 new ones.
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-02931554
Contributor : Eugene Volanschi <>
Submitted on : Thursday, September 10, 2020 - 10:10:22 AM
Last modification on : Wednesday, September 23, 2020 - 5:02:39 PM

Identifiers

Collections

Citation

Nic Volanschi, Julia Lawall. The Impact of Generic Data Structures: Decoding the Role of Lists in the Linux Kernel. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE ’20), Sep 2020, Virtual Event, Australia. ⟨10.1145/3324884.3416635⟩. ⟨hal-02931554v2⟩

Share

Metrics

Record views

39

Files downloads

69