Faithful and Robust Local Interpretability for Textual Predictions

Gianluigi Lopardo; Frederic Precioso; Damien Garreau

Pré-Publication, Document De Travail Année : 2023

Faithful and Robust Local Interpretability for Textual Predictions

(1, 2) , (3, 1) , (1, 2, 4)

1
2
3
4

Gianluigi Lopardo

Fonction : Auteur
PersonId : 1239249

Modèles et algorithmes pour l’intelligence artificielle

Université Côte d'Azur

Frederic Precioso

Fonction : Auteur

Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis

Modèles et algorithmes pour l’intelligence artificielle

Damien Garreau

Fonction : Auteur

Modèles et algorithmes pour l’intelligence artificielle

Université Côte d'Azur

Laboratoire Jean Alexandre Dieudonné

Résumé

Interpretability is essential for machine learning models to be trusted and deployed in critical domains. However, existing methods for interpreting text models are often complex, lack solid mathematical foundations, and their performance is not guaranteed. In this paper, we propose FRED (Faithful and Robust Explainer for textual Documents), a novel method for interpreting predictions over text. FRED identifies key words in a document that significantly impact the prediction when removed. We establish the reliability of FRED through formal definitions and theoretical analyses on interpretable classifiers. Additionally, our empirical evaluation against state-of-the-art methods demonstrates the effectiveness of FRED in providing insights into text models.

Domaines

Informatique [cs] Statistiques [stat]

Gianluigi Lopardo : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-04394149

Soumis le : lundi 15 janvier 2024-11:21:57

Dernière modification le : mardi 30 avril 2024-13:41:35

Dates et versions

hal-04394149 , version 1 (15-01-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04394149 , version 1
ARXIV : 2311.01605

Citer

Gianluigi Lopardo, Frederic Precioso, Damien Garreau. Faithful and Robust Local Interpretability for Textual Predictions. 2024. ⟨hal-04394149⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA I3S INSMI DIEUDONNE INRIA2 UNIV-COTEDAZUR ANR

21 Consultations

0 Téléchargements

Faithful and Robust Local Interpretability for Textual Predictions

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager