Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees

Annabelle Redelmeier
  • Fonction : Auteur
  • PersonId : 1115801
Martin Jullum
  • Fonction : Auteur
  • PersonId : 1115802
Kjersti Aas
  • Fonction : Auteur
  • PersonId : 1115803

Résumé

It is becoming increasingly important to explain complex, black-box machine learning models. Although there is an expanding literature on this topic, Shapley values stand out as a sound method to explain predictions from any type of machine learning model. The original development of Shapley values for prediction explanation relied on the assumption that the features being described were independent. This methodology was then extended to explain dependent features with an underlying continuous distribution. In this paper, we propose a method to explain mixed (i.e. continuous, discrete, ordinal, and categorical) dependent features by modeling the dependence structure of the features using conditional inference trees. We demonstrate our proposed method against the current industry standards in various simulation studies and find that our method often outperforms the other approaches. Finally, we apply our method to a real financial data set used in the 2018 FICO Explainable Machine Learning Challenge and show how our explanations compare to the FICO challenge Recognition Award winning team.
Fichier principal
Vignette du fichier
497121_1_En_7_Chapter.pdf (430.89 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03414718 , version 1 (04-11-2021)

Licence

Paternité

Identifiants

Citer

Annabelle Redelmeier, Martin Jullum, Kjersti Aas. Explaining Predictive Models with Mixed Features Using Shapley Values and Conditional Inference Trees. 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2020, Dublin, Ireland. pp.117-137, ⟨10.1007/978-3-030-57321-8_7⟩. ⟨hal-03414718⟩
43 Consultations
53 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More