Skip to Main content Skip to Navigation
Conference papers

Using privacy-transformed speech in the automatic speech recognition acoustic model training

Abstract : Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is a interest in collection of the anonymized speech data that is processed by some voice conversion method. In this paper we evaluate one of voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and effectiveness of using privacy-transformed data in ASR training.
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-02907056
Contributor : Emmanuel Vincent <>
Submitted on : Monday, July 27, 2020 - 9:43:27 AM
Last modification on : Tuesday, July 28, 2020 - 8:53:12 AM

File

Voice_privacy_transform_ASR.pd...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02907056, version 1

Collections

Citation

Askars Salimbajevs. Using privacy-transformed speech in the automatic speech recognition acoustic model training. 9th International Conference on Human Language Technologies - the Baltic Perspective (Baltic HLT 2020), Sep 2020, Kaunas, Lithuania. ⟨hal-02907056⟩

Share

Metrics

Record views

30

Files downloads

73