A. Nautsch, C. Jasserand, E. Kindt, M. Todisco, I. Trancoso et al., The GDPR & speech data: Reflections of legal and technology communities, first steps towards a common understanding, pp.3695-3699, 2019.

A. Nautsch, A. Jimenez, A. Treiber, J. Kolberg, C. Jasserand et al., Preserving privacy in speaker and speech characterisation, Computer Speech and Language, vol.58, pp.441-480, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02307615

, Deliverable Nº5.1: Data protection and GDPR requirements

A. Cohen-hadria, M. Cartwright, B. Mcfee, and J. P. Bello, Voice anonymization in urban sound recordings, 2019 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp.1-6, 2019.

F. Gontier, M. Lagrange, C. Lavandier, and J. Petiot, Privacy aware acoustic scene synthesis using deep spectral feature inversion, 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, p.2020
URL : https://hal.archives-ouvertes.fr/hal-02478866

M. A. Pathak, B. Raj, S. D. Rane, and P. Smaragdis, Privacypreserving speech processing: cryptographic and string-matching frameworks show promise, IEEE Signal Processing Magazine, vol.30, issue.2, pp.62-74, 2013.

P. Smaragdis and M. Shashanka, A framework for secure speech recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, issue.4, pp.1404-1413, 2007.

S. Zhang, Y. Gong, and D. Yu, Encrypted speech recognition using deep polynomial networks, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5691-5695, 2019.

F. Brasser, T. Frassetto, K. Riedhammer, A. Sadeghi, T. Schneider et al., VoiceGuard: Secure and private speech processing, in Interspeech, pp.1303-1307, 2018.

D. Leroy, A. Coucke, T. Lavril, T. Gisselbrecht, and J. Dureau, Federated learning for keyword spotting, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6341-6345, 2019.

J. Geiping, H. Bauermeister, H. Dröge, and M. Moeller, Inverting gradients -how easy is it to break privacy in federated learning, 2020.

K. Hashimoto, J. Yamagishi, and I. Echizen, Privacy-preserving sound to degrade automatic speaker verification performance, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5500-5504, 2016.

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Voicemask: Anonymize and sanitize voice input on mobile devices, 2017.

Q. Jin, A. R. Toth, T. Schultz, and A. W. Black, Speaker deidentification via voice transformation, 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.529-533, 2009.

M. Pobar and I. Ip?i?, Online speaker de-identification using voice transformation, 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp.1264-1267, 2014.

F. Bahmaninezhad, C. Zhang, and J. H. Hansen, Convolutional neural network based speaker de-identification, pp.255-260, 2018.

F. Fang, X. Wang, J. Yamagishi, I. Echizen, M. Todisco et al., Speaker anonymization using x-vector and neural waveform models, Speech Synthesis Workshop, pp.155-160, 2019.

Y. Han, S. Li, Y. Cao, Q. Ma, and M. Yoshikawa, Voiceindistinguishability: Protecting voiceprint in privacy-preserving speech data release, 2020.

B. M. Srivastava, A. Bellet, M. Tommasi, and E. Vincent, Privacy-preserving adversarial representation learning in ASR: Reality or illusion?, in Interspeech, pp.3700-3704, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02166434

N. Tomashenko, B. M. Srivastava, X. Wang, E. Vincent, A. Nautsch et al., The VoicePrivacy 2020 Challenge evaluation plan, 2020.

J. Qian, F. Han, J. Hou, C. Zhang, Y. Wang et al., Towards privacy-preserving speech data publishing, 2018 IEEE Conference on Computer Communications (INFOCOM), pp.1079-1087, 2018.

B. M. Srivastava, N. Vauquier, M. Sahidullah, A. Bellet, M. Tommasi et al., Evaluating voice conversion-based privacy protection against informed attackers, 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.2020
URL : https://hal.archives-ouvertes.fr/hal-02355115

A. Nagrani, J. S. Chung, and A. Zisserman, VoxCeleb: a largescale speaker identification dataset, pp.2616-2620, 2017.

J. S. Chung, A. Nagrani, and A. Zisserman, VoxCeleb2: Deep speaker recognition, in Interspeech, pp.1086-1090, 2018.

V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, Lib-riSpeech: an ASR corpus based on public domain audio books, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5206-5210, 2015.

H. Zen, V. Dang, R. Clark, Y. Zhang, R. J. Weiss et al., LibriTTS: A corpus derived from LibriSpeech for text-to-speech, pp.1526-1530, 2019.

C. Veaux, J. Yamagishi, and K. Macdonald, CSTR VCTK corpus: English multi-speaker corpus for CSTR voice cloning toolkit (version 0.92), 2019.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The Kaldi speech recognition toolkit, Tech. Rep, 2011.

D. Snyder, D. Garcia-romero, G. Sell, D. Povey, and S. Khudanpur, X-vectors: Robust DNN embeddings for speaker recognition, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5329-5333, 2018.

N. Brümmer and J. D. Preez, Application-independent evaluation of speaker detection, Computer Speech and Language, vol.20, issue.2-3, pp.230-275, 2006.

D. Ramos and J. Gonzalez-rodriguez, Cross-entropy analysis of the information in forensic speaker recognition, 2008.

D. Povey, G. Cheng, Y. Wang, K. Li, H. Xu et al., Semi-orthogonal low-rank matrix factorization for deep neural networks, in Interspeech, pp.3743-3747, 2018.

V. Peddinti, D. Povey, and S. Khudanpur, A time delay neural network architecture for efficient modeling of long temporal contexts, pp.3214-3218, 2015.

J. Lorenzo-trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio et al., The Voice Conversion Challenge 2018: Promoting development of parallel and nonparallel methods, pp.195-202, 2018.

X. Wang and J. Yamagishi, Neural harmonic-plus-noise waveform model with trainable maximum voice frequency for text-tospeech synthesis, Speech Synthesis Workshop, pp.1-6, 2019.

B. M. Srivastava, N. Tomashenko, X. Wang, E. Vincent, J. Yamagishi et al., Design choices for x-vector based speaker anonymization

J. Patino, M. Todisco, A. Nautsch, and N. Evans, Speaker anonymisation using the McAdams coefficient, Eurecom, Tech. Rep, vol.6190, 2020.