Alexa vs. Siri vs. Cortana vs. Google Assistant: a comparison of speech-based natural user interfaces, International Conference on Applied Human Factors and Ergonomics, pp.241-250, 2017. ,
Next-generation of virtual personal assistants (Microsoft Cortana, Apple Siri, Amazon Alexa and Google Home), IEEE CCWC, pp.99-103, 2018. ,
The insecurity of home digital voice assistants -Amazon Alexa as a case study, 2017. ,
Alexa, can i trust you, Computer, vol.50, issue.9, pp.100-104, 2017. ,
Speaker identification and verification using Gaussian mixture speaker models, Speech Communication, vol.17, issue.1-2, pp.91-108, 1995. ,
Speech intention classification with multimodal deep learning, Canadian Conference on Artificial Intelligence, pp.260-271, 2017. ,
Prosody conveys speaker's intentions: Acoustic cues for speech act perception, Journal of Memory and Language, vol.88, pp.70-86, 2016. ,
Speech act classification: A study in the lexical analysis of English speech activity verbs, vol.8, 2013. ,
Dialog act modeling for conversational speech, AAAI Spring Symposium on Applying Machine Learning to Discourse Processing, pp.98-105, 1998. ,
Robust GMM based gender classification using pitch and RASTA-PLP parameters of speech, International Conference on Machine Learning and Cybernetics, pp.3376-3379, 2006. ,
Gender classification in two emotional speech databases, ICPR, pp.1-4, 2008. ,
Survey on speech emotion recognition: Features, classification schemes, and databases, Pattern Recognition, vol.44, issue.3, pp.572-587, 2011. ,
Automatic speech classification to five emotional states based on gender information, pp.341-344, 2004. ,
Emotion recognition by speech signals, EuroSpeech, 2003. ,
Feature analysis for automatic detection of pathological speech, 2nd Joint EMBS-BMES Conference, vol.1, pp.182-183, 2002. ,
Feature analysis of pathological speech signals using local discriminant bases technique, Medical and Biological Engineering and Computing, vol.43, issue.4, pp.457-464, 2005. ,
, The INTERSPEECH 2013 computational paralinguistics challenge: social signals, pp.148-152, 2013.
Computational paralinguistics: emotion, affect and personality in speech and language processing, 2013. ,
A survey on perceived speaker traits: Personality, likability, pathology, and the first challenge, Computer Speech & Language, vol.29, issue.1, pp.100-131, 2015. ,
Cultural and linguistic factors in audiovisual speech processing: The McGurk effect in Chinese subjects, Perception & Psychophysics, vol.59, issue.1, pp.73-80, 1997. ,
Social signal processing: Survey of an emerging domain, Image and Vision Computing, vol.27, issue.12, pp.1743-1759, 2009. ,
Privacy-preserving machine learning for speech processing, 2012. ,
Privacy preserving encrypted phonetic search of speech data, IEEE ICASSP, pp.6414-6418, 2017. ,
X-vectors: Robust DNN embeddings for speaker recognition, IEEE ICASSP, pp.5329-5333, 2018. ,
Espnet: End-to-end speech processing toolkit, pp.2207-2211, 2018. ,
Learning anonymized representations with adversarial neural networks, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01742447
Invariant representations for noisy speech recognition, 2016. ,
Speaker invariant feature extraction for zero-resource languages with adversarial learning, IEEE ICASSP, pp.2381-2385, 2018. ,
Speaker-invariant training via adversarial learning, IEEE ICASSP, pp.5969-5973, 2018. ,
To reverse the gradient or not: An empirical comparison of adversarial and multi-task learning in speech recognition, 2018. ,
Librispeech: an ASR corpus based on public domain audio books, IEEE ICASSP, pp.5206-5210, 2015. ,
Hybrid CTC/attention architecture for end-to-end speech recognition, IEEE Journal of Selected Topics in Signal Processing, vol.11, issue.8, pp.1240-1253, 2017. ,
Domainadversarial training of neural networks, JMLR, vol.17, issue.1, pp.2096-2030, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01624607
A study on data augmentation of reverberant speech for robust speech recognition, IEEE, pp.5220-5224, 2017. ,
MUSAN: A music, speech, and noise corpus, 2015. ,
The Kaldi speech recognition toolkit, Tech. Rep, 2011. ,
Attention-based models for speech recognition, NIPS, pp.577-585, 2015. ,
Fully convolutional speech recognition, 2018. ,