Static Analysis for Data Scientists - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Chapitre D'ouvrage Année : 2023

Static Analysis for Data Scientists

Résumé

Big data analytics has revolutionized the world of software development in the past decade. Every day, data scientists develop computer programs to gather, triage, and process data, in order to ultimately help us make data-driven decisions. As we rely more and more such data-manipulating software, we become increasingly vulnerable to poor choices, wrong assumptions, or other (programming or technical) mistakes made during software development. Mistakes that do not cause software failures can have serious consequences, since they give no indication that something went wrong along the way. In safety-critical applications, such mistakes can be deadly. In this chapter, we will present ongoing work to develop an abstract interpretation-based static analysis framework for data scientists. In particular, we will focus on issues arising from unexpected data and describe the challenges involved in designing and developing a practical static analysis that infers necessary expectations on the data read and manipulated using Jupyter notebooks, an increasingly popular development environment among data scientists.
Fichier principal
Vignette du fichier
csv2023.pdf (252.02 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04249957 , version 1 (19-10-2023)

Licence

Paternité

Identifiants

Citer

Caterina Urban. Static Analysis for Data Scientists. Challenges of Software Verification, 238, Springer Nature Singapore, pp.77-91, 2023, Intelligent Systems Reference Library, ⟨10.1007/978-981-19-9601-6_5⟩. ⟨hal-04249957⟩
28 Consultations
7 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More