How to select predictive models for decision making or causal inference? - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2023

Wie wählt man Vorhersagemodelle für die Entscheidungsfindung oder für kausale Schlussfolgerungen aus?

How to select predictive models for decision making or causal inference?

Comment sélectionner des models prédictifs pour la prise de décision ou l'inférence causale ?

Résumé

Objective: We investigate which procedure selects the predictive model most trustworthy to reason on the effect of an intervention and support decision making. Methods: We study such a variety of model selection procedures in practical settings: finite sam- ples settings and without theoretical assumption of well-specified models. Beyond standard cross- validation or internal validation procedures, we also study elaborate causal risks. These build proxies of the causal error using “nuisance” re-weighting to compute it on the observed data. We evaluate whether empirically estimated nuisances, which are necessarily noisy, add noise to model selection. We compare different metrics for causal model selection in an extensive empirical study based on a simulation and three healthcare datasets based on real covariates. Results: Among all metrics, the mean squared error, classically used to evaluate predictive modes, is worse. Re-weighting it with propensity score does not bring much improvements. The R-risk, which uses as nuisances a model of mean outcome and propensity scores, leads to the best performances. Nuisance corrections are best estimated with flexible estimators such as a super learner. Conclusions: When predictive models are used to reason on the effect of an intervention, they must be evaluated with different procedures than standard predictive settings; using the R-risk from causal inference.
Fichier principal
Vignette du fichier
how_to_select_causal_models.pdf (11.77 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03946902 , version 1 (19-01-2023)
hal-03946902 , version 2 (24-05-2023)
hal-03946902 , version 3 (16-06-2023)
hal-03946902 , version 4 (08-01-2024)

Licence

Licence Ouverte - etalab

Identifiants

Citer

Matthieu Doutreligne, Gaël Varoquaux. How to select predictive models for decision making or causal inference?. 2023. ⟨hal-03946902v4⟩
234 Consultations
259 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More