Using privacy-transformed speech in the automatic speech recognition acoustic model training

Askars Salimbajevs

Communication Dans Un Congrès Année : 2020

Using privacy-transformed speech in the automatic speech recognition acoustic model training

(1, 2)

1
2

Askars Salimbajevs

Fonction : Auteur
PersonId : 1068872

Tilde

University of Latvia

Résumé

Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is a interest in collection of the anonymized speech data that is processed by some voice conversion method. In this paper we evaluate one of voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and effectiveness of using privacy-transformed data in ASR training.

Mots clés

automatic speech recognition voice conversion privacy anonymization evaluation automatic speaker verification

Domaines

Informatique et langage [cs.CL] Apprentissage [cs.LG]

Fichier principal

Voice_privacy_transform_ASR.pdf (161.43 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02907056

Soumis le : lundi 27 juillet 2020-09:43:27

Dernière modification le : mardi 28 juillet 2020-08:53:12

Archivage à long terme le : mardi 1 décembre 2020-07:12:57

Dates et versions

hal-02907056 , version 1 (27-07-2020)

Identifiants

HAL Id : hal-02907056 , version 1

Citer

Askars Salimbajevs. Using privacy-transformed speech in the automatic speech recognition acoustic model training. 9th International Conference on Human Language Technologies - the Baltic Perspective (Baltic HLT 2020), Sep 2020, Kaunas, Lithuania. ⟨hal-02907056⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

71 Consultations

213 Téléchargements

Using privacy-transformed speech in the automatic speech recognition acoustic model training

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager