How to select predictive models for decision making or causal inference? - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Preprints, Working Papers, ... Year : 2023

Wie wählt man Vorhersagemodelle für die Entscheidungsfindung oder für kausale Schlussfolgerungen aus?

How to select predictive models for decision making or causal inference?

Comment sélectionner des models prédictifs pour la prise de décision ou l'inférence causale ?

Abstract

Objective: We investigate which procedure selects the predictive model most trustworthy to reason on the effect of an intervention and support decision making. Methods: We study such a variety of model selection procedures in practical settings: finite sam- ples settings and without theoretical assumption of well-specified models. Beyond standard cross- validation or internal validation procedures, we also study elaborate causal risks. These build proxies of the causal error using “nuisance” re-weighting to compute it on the observed data. We evaluate whether empirically estimated nuisances, which are necessarily noisy, add noise to model selection. We compare different metrics for causal model selection in an extensive empirical study based on a simulation and three healthcare datasets based on real covariates. Results: Among all metrics, the mean squared error, classically used to evaluate predictive modes, is worse. Re-weighting it with propensity score does not bring much improvements. The R-risk, which uses as nuisances a model of mean outcome and propensity scores, leads to the best performances. Nuisance corrections are best estimated with flexible estimators such as a super learner. Conclusions: When predictive models are used to reason on the effect of an intervention, they must be evaluated with different procedures than standard predictive settings; using the R-risk from causal inference.
Fichier principal
Vignette du fichier
how_to_select_causal_models.pdf (11.77 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03946902 , version 1 (19-01-2023)
hal-03946902 , version 2 (24-05-2023)
hal-03946902 , version 3 (16-06-2023)
hal-03946902 , version 4 (08-01-2024)

Licence

Open licence - etalab

Identifiers

Cite

Matthieu Doutreligne, Gaël Varoquaux. How to select predictive models for decision making or causal inference?. 2023. ⟨hal-03946902v4⟩
236 View
279 Download

Altmetric

Share

Gmail Facebook X LinkedIn More