Model Interpretability through the Lens of Computational Complexity

Pablo Barceló; Mikaël Monet; Jorge A. Perez; Bernardo Subercaseaux

Communication Dans Un Congrès Année : 2020

Model Interpretability through the Lens of Computational Complexity

(1) , (2) , (3, 1) , (3, 1)

1
2
3

Pablo Barceló

Fonction : Auteur

Millennium Institute for Foundational Research on Data

Mikaël Monet

Fonction : Auteur
PersonId : 1085321

Linking Dynamic Data

Jorge A. Perez

Fonction : Auteur
PersonId : 998096

Department of Computer Science

Millennium Institute for Foundational Research on Data

Bernardo Subercaseaux

Fonction : Auteur

Department of Computer Science

Millennium Institute for Foundational Research on Data

Résumé

In spite of several claims stating that some models are more interpretable than others -- e.g., "linear models are more interpretable than deep neural networks" -- we still lack a principled notion of interpretability to formally compare among different classes of models. We make a step towards such a notion by studying whether folklore interpretability claims have a correlate in terms of computational complexity theory. We focus on local post-hoc explainability queries that, intuitively, attempt to answer why individual inputs are classified in a certain way by a given model. In a nutshell, we say that a class $\mathcal{C}_1$ of models is more interpretable than another class $\mathcal{C}_2$, if the computational complexity of answering post-hoc queries for models in $\mathcal{C}_2$ is higher than for those in $\mathcal{C}_1$. We prove that this notion provides a good theoretical counterpart to current beliefs on the interpretability of models; in particular, we show that under our definition and assuming standard complexity-theoretical assumptions (such as P$\neq$NP), both linear and tree-based models are strictly more interpretable than neural networks. Our complexity analysis, however, does not provide a clear-cut difference between linear and tree-based models, as we obtain different results depending on the particular post-hoc explanations considered. Finally, by applying a finer complexity analysis based on parameterized complexity, we are able to prove a theoretical result suggesting that shallow neural networks are more interpretable than deeper ones.

Domaines

Informatique [cs] Intelligence artificielle [cs.AI] Complexité [cs.CC]

Fichier principal

2010.12265.pdf (681.7 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Mikaël Monet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03052508

Soumis le : jeudi 10 décembre 2020-16:12:54

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-03052508 , version 1 (10-12-2020)

Identifiants

HAL Id : hal-03052508 , version 1
ARXIV : 2010.12265

Citer

Pablo Barceló, Mikaël Monet, Jorge A. Perez, Bernardo Subercaseaux. Model Interpretability through the Lens of Computational Complexity. NeurIPS 2020, Dec 2020, Held online, United States. ⟨hal-03052508⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRIA-CHILE CRISTAL INRIA2 CRISTAL-LINKS UNIV-LILLE

60 Consultations

50 Téléchargements

Model Interpretability through the Lens of Computational Complexity

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager