Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks

Aleena Thomas; David Ifeoluwa Adelani; Ali Davody; Aditya Mogadala; Dietrich Klakow

Communication Dans Un Congrès Année : 2020

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks

(1) , (1) , (1) , (1) , (1)

Aleena Thomas

Fonction : Auteur
PersonId : 736124
IdHAL : aleena-thomas
ORCID : 0000-0003-3606-8405

Universität des Saarlandes [Saarbrücken]

David Ifeoluwa Adelani

Fonction : Auteur
PersonId : 1073845

Universität des Saarlandes [Saarbrücken]

Ali Davody

Fonction : Auteur
PersonId : 1073846

Universität des Saarlandes [Saarbrücken]

Aditya Mogadala

Fonction : Auteur
PersonId : 1073847

Universität des Saarlandes [Saarbrücken]

Dietrich Klakow

Fonction : Auteur

Universität des Saarlandes [Saarbrücken]

Résumé

The sensitive information present in the training data, poses a privacy concern for applications as their unintended memorization during training can make models susceptible to membership inference and attribute inference attacks. In this paper, we investigate this problem in various pre-trained word embeddings (GloVe, ELMo and BERT) with the help of language models built on top of it. In particular, firstly sequences containing sensitive information like a single-word disease and 4-digit PIN are randomly inserted into the training data, then a language model is trained using word vectors as input features, and memorization is measured with a metric termed as exposure. The embedding dimension , the number of training epochs, and the length of the secret information were observed to affect memorization in pre-trained embeddings. Finally, to address the problem, differentially private language models were trained to reduce the exposure of sensitive information.

Mots clés

differential privacy word representations unintended mem- orization

Domaines

Informatique et langage [cs.CL]

Fichier principal

ThomasA+20.pdf (347.1 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Zaineb Chelly Dagdia : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02880590

Soumis le : jeudi 25 juin 2020-17:20:45

Dernière modification le : mardi 4 août 2020-11:10:02

Archivage à long terme le : mercredi 23 septembre 2020-15:45:39

Dates et versions

hal-02880590 , version 1 (25-06-2020)

Identifiants

HAL Id : hal-02880590 , version 1

Citer

Aleena Thomas, David Ifeoluwa Adelani, Ali Davody, Aditya Mogadala, Dietrich Klakow. Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks. 23rd International Conference on Text, Speech and Dialogue, Sep 2020, brno, Czech Republic. ⟨hal-02880590⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

147 Consultations

1048 Téléchargements

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager