A. Nautsch, A. Jimenez, A. Treiber, J. Kolberg, C. Jasserand et al., Preserving privacy in speaker and speech characterisation, Computer Speech and Language, vol.58, pp.441-480, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02307615

N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, Front-end factor analysis for speaker verification, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, pp.788-798, 2010.

D. Snyder, D. Garcia-romero, G. Sell, D. Povey, and S. Khudanpur, X-vectors: Robust DNN embeddings for speaker recognition, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5329-5333, 2018.

K. Hashimoto, J. Yamagishi, and I. Echizen, Privacy-preserving sound to degrade automatic speaker verification performance, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5500-5504, 2016.

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Voicemask: Anonymize and sanitize voice input on mobile devices, 2017.

Q. Jin, A. R. Toth, T. Schultz, and A. W. Black, Speaker deidentification via voice transformation, 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.529-533, 2009.

M. Pobar and I. Ip?i?, Online speaker de-identification using voice transformation, 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp.1264-1267, 2014.

F. Bahmaninezhad, C. Zhang, and J. H. Hansen, Convolutional neural network based speaker de-identification, pp.255-260, 2018.

F. Fang, X. Wang, J. Yamagishi, I. Echizen, M. Todisco et al., Speaker anonymization using x-vector and neural waveform models, 10th ISCA Speech Synthesis Workshop, pp.155-160, 2019.

B. M. Srivastava, A. Bellet, M. Tommasi, and E. Vincent, Privacy-preserving adversarial representation learning in ASR: Reality or illusion?, in Interspeech, pp.3700-3704, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02166434

. Iso/iec, Information Technology -Biometric performance testing and reporting -Part 1: Principles and framework, 2006.

N. Brümmer and J. A. Du-preez, Application-independent evaluation of speaker detection, Computer Speech and Language, vol.20, issue.2-3, pp.230-275, 2006.

M. Gomez-barrero, J. Galbally, C. Rathgeb, and C. Busch, General framework to evaluate unlinkability in biometric template protection systems, IEEE Transactions on Information Forensics and Security, vol.13, issue.6, pp.1406-1420, 2017.

B. M. Srivastava, N. Vauquier, M. Sahidullah, A. Bellet, M. Tommasi et al., Evaluating voice conversion-based privacy protection against informed attackers, 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.2802-2806, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02355115

J. Qian, F. Han, J. Hou, C. Zhang, Y. Wang et al., Towards privacy-preserving speech data publishing, 2018 IEEE Conference on Computer Communications (INFOCOM), pp.1079-1087, 2018.

B. M. Srivastava, N. Vauquier, M. Sahidullah, A. Bellet, M. Tommasi et al., Evaluating voice conversion-based privacy protection against informed attackers, 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2802-2806, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02355115

D. A. Van-leeuwen and N. Brümmer, An introduction to application-independent evaluation of speaker recognition systems, Speaker Classification I: Fundamentals, Features, and Methods, pp.330-353, 2007.

B. M. Srivastava, N. Tomashenko, X. Wang, E. Vincent, J. Yamagishi et al., Design choices for x-vector based speaker anonymization
URL : https://hal.archives-ouvertes.fr/hal-02610447

V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, Librispeech: An ASR corpus based on public domain audio books, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5206-5210, 2015.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The Kaldi speech recognition toolkit, 2011 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011.

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Hidebehind: Enjoy voice input with voiceprint unclonability and anonymity, 16th ACM Conference on Embedded Networked Sensor Systems (SenSys), pp.82-94, 2018.

D. Sundermann and H. Ney, VTLN-based voice conversion, 3rd IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), pp.556-559, 2003.

J. Chou and H. Lee, One-shot voice conversion by separating speaker and content representations with instance normalization, pp.664-668, 2019.

D. Ulyanov, A. Vedaldi, and V. Lempitsky, Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR, pp.6924-6932, 2017.

N. Tomashenko, B. M. Srivastava, X. Wang, E. Vincent, A. Nautsch et al., Introducing the VoicePrivacy initiative, Interspeech, submitted
URL : https://hal.archives-ouvertes.fr/hal-02562199

X. Wang and J. Yamagishi, Neural harmonic-plus-noise waveform model with trainable maximum voice frequency for textto-speech synthesis, 10th ISCA Speech Synthesis Workshop, pp.1-6, 2019.