. Székelyé, G. E. Henter, J. Beskow, and J. Gustafson, Spontaneous conversational speech synthesis from found data, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019.

V. Vestman, T. Kinnunen, G. Hautamäki, R. Sahidullah, and M. , Voice Mimicry Attacks Assisted by Automatic Speaker Verification, Comput Speech Lang, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02161773

D. Snyder, D. Garcia-romero, G. Sell, D. Povey, S. Khudanpur et al., ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing -Proceedings, 2018.

B. Srivastava, N. Vauquier, M. Sahidullah, A. Bellet, M. Tommasi et al., Evaluating Voice Conversion-based Privacy Protection against Informed Attackers, ICASSP 2020 -45th International Conference on Acoustics, Speech, and Signal Processing
URL : https://hal.archives-ouvertes.fr/hal-02355115

L. Srivastava, B. M. Bellet, A. Tommasi, M. Vincent, and E. , Privacy-preserving adversarial representation learning in ASR: Reality or illusion?, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02166434

S. Ribaric, A. Ariyaeeinia, and N. Pavesic, De-identification for privacy protection in multimedia content: A survey, Signal Process Image Commun, 2016.

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Hidebehind: Enjoy voice input with voiceprint unclonability and anonymity, SenSys 2018 -Proceedings of the 16th Conference on Embedded Networked Sensor Systems, 2018.

N. Tomashenko, B. Srivastava, X. Wang, E. Vincent, A. Nautsch et al., The VoicePrivacy 2020 Challenge Evaluation Plan

J. Qian, H. Du, J. Hou, L. Chen, T. Jung et al., Anonymize and Sanitize Voice Input on Mobile Devices, 2017.

S. Ioffe, Probabilistic linear discriminant analysis, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 2006.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The Kaldi Speech Recognition Toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, 2011.

J. S. Chung, A. Nagrani, and A. Zisserman, VoxceleB2: Deep speaker recognition, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018.

M. Pinnis, I. Auzi?a, and K. Goba, Designing the Latvian Speech Recognition Corpus, Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC'14), pp.1547-53, 2014.

A. Salimbajevs, N. Calzolari, K. Choukri, C. Cieri, T. Declerck et al., Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data, Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018.

, European Language Resources Association (ELRA), 2018.

D. Povey, G. Cheng, Y. Wang, K. Li, H. Xu et al., Semi-orthogonal low-rank matrix factorization for deep neural networks, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018.

D. Povey, V. Peddinti, D. Galvez, P. Ghahremani, V. Manohar et al., Purely sequence-trained neural networks for ASR based on lattice-free MMI, Proceedings of the Annual Conference of the International Speech Communication Association, pp.2751-2756, 2016.

H. Hadian, H. Sameti, D. Povey, and S. Khudanpur, End-to-end speech recognition using lattice-free MMI, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018.

D. B. Paul and J. M. Baker, The design for the Wall Street Journal-based CSR corpus, Proceedings of the workshop on Speech and Natural Language, pp.357-362, 1992.

P. Smit, S. Virpioja, and M. Kurimo, Improved Subword Modeling for WFST-Based Speech Recognition, Proceedings of the Annual Conference of the International Speech Communication Association, pp.2551-2556