Perfectly Parallel Fairness Certification of Neural Networks

Caterina Urban; Maria Christakis; Valentin Wüstholz; Fuyuan Zhang

Pré-Publication, Document De Travail Année : 2019

Perfectly Parallel Fairness Certification of Neural Networks

(1, 2) , (3) , (4) , (3)

1
2
3
4

Caterina Urban

Fonction : Auteur
PersonId : 1061085
IdHAL : caterina

Département d'informatique - ENS Paris

Analyse Statique par Interprétation Abstraite

Maria Christakis

Fonction : Auteur

Max Planck Institute for Software Systems

Valentin Wüstholz

Fonction : Auteur
PersonId : 1060456

ConsenSys Diligence

Fuyuan Zhang

Fonction : Auteur
PersonId : 1060457

Max Planck Institute for Software Systems

Résumé

Recently, there is growing concern that machine-learning models, which currently assist or even automate decision making, reproduce, and in the worst case reinforce, bias of the training data. The development of tools and techniques for certifying fairness of these models or describing their biased behavior is, therefore, critical. In this paper, we propose a perfectly parallel static analysis for certifying causal fairness of feed-forward neural networks used for classification tasks. When certification succeeds, our approach provides definite guarantees, otherwise, it describes and quantifies the biased behavior. We design the analysis to be sound, in practice also exact, and configurable in terms of scalability and precision, thereby enabling pay-as-you-go certification. We implement our approach in an open-source tool and demonstrate its effectiveness on models trained with popular datasets.

Domaines

Langage de programmation [cs.PL] Logique en informatique [cs.LO] Ordinateur et société [cs.CY]

Fichier principal

fairness.pdf (1.11 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Caterina Urban : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02404036

Soumis le : mercredi 11 décembre 2019-09:57:58

Dernière modification le : vendredi 19 avril 2024-16:18:56

Archivage à long terme le : jeudi 12 mars 2020-16:50:56

Dates et versions

hal-02404036 , version 1 (11-12-2019)

Identifiants

HAL Id : hal-02404036 , version 1

Citer

Caterina Urban, Maria Christakis, Valentin Wüstholz, Fuyuan Zhang. Perfectly Parallel Fairness Certification of Neural Networks. 2019. ⟨hal-02404036⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 TDS-MACS PSL

40 Consultations

67 Téléchargements

Perfectly Parallel Fairness Certification of Neural Networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager