Techniques for Noise Robustness in Automatic Speech Recognition, 2012. ,
Robust Automatic Speech Recognition -A Bridge to Practical Applications. Elsevier, 2015. ,
New Era for Robust Speech Recognition -Exploiting Deep Learning, 2017. ,
Audio Source Separation and Speech Enhancement, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01881431
Audio Source Separation, 2018. ,
URL : https://hal.archives-ouvertes.fr/inria-00544199
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques, IEEE Signal Processing Magazine, vol.36, issue.6, pp.111-124, 2019. ,
SPEECHDAT-CAR. a large speech database for automotive environments, Proc. 2nd Int. Conf. on Language Resources and Evaluation (LREC), 2000. ,
CU-Move": Analysis & corpus development for interactive in-vehicle speech systems, Proc. Eurospeech, pp.2023-2026, 2001. ,
The translingual English database (TED), Proc. 3rd Int. Conf. on Spoken Language Processing (ICSLP), 1994. ,
Recognition of overlapping speech using digital MEMS microphone arrays, Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp.7068-7072, 2013. ,
The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes, Computer Speech and Language, vol.46, pp.605-626, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01382108
An analysis of environment, microphone and data simulation mismatches in robust speech recognition, Computer Speech and Language, vol.46, pp.535-557, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01399180
The ETAPE corpus for the evaluation of speechbased TV content processing in the French language, Proc. 8th ,
URL : https://hal.archives-ouvertes.fr/hal-00712591
, on Language Resources and Evaluation (LREC), pp.114-118, 2012.
The MGB challenge: Evaluating multi-genre broadcast media recognition, Proc. IEEE Automatic Speech Recognition and Understanding Workshop, pp.687-693, 2015. ,
The PASCAL CHiME speech separation and recognition challenge, Computer Speech and Language, vol.27, issue.3, pp.621-633, 2013. ,
URL : https://hal.archives-ouvertes.fr/inria-00584051
The second CHiME speech separation and recognition challenge: Datasets, tasks and baselines, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp.126-130, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00796625
WOZ acoustic data collection for interactive TV, Proc. 6th Int. Conf. on Language Resources and Evaluation (LREC), pp.2330-2334, 2008. ,
The Sweet-Home speech and multimodal corpus for home automation interaction, Proc. 9th Int. Conf. on Language Resources and Evaluation (LREC), pp.4499-4509, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00953006
The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments, Proc. IEEE Automatic Speech Recognition and Understanding Workshop, pp.275-282, 2015. ,
A French corpus for distant-microphone speech processing in real homes, Proc. Interspeech, pp.2781-2785, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01343060
VoiceHome-2, an extended corpus for multichannel speech processing in real homes, Speech Communication, vol.106, pp.68-78, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01923108
Toward human parity in conversational speech recognition, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.12, pp.2410-2423, 2017. ,
English conversational telephone speech recognition by humans and machines, Proc. Interspeech, pp.132-136, 2017. ,
SWITCHBOARD: Telephone speech corpus for research and development, Proc. IEEE International Conf. on Acoustics, Speech, and Signal Proc. (ICASSP), vol.1, pp.517-520, 1992. ,
The automatic speech recognition in reverberant environments (ASpIRE) challenge, Proc. IEEE Automatic Speech Recognition and Understanding Workshop, pp.547-554, 2015. ,
The ICSI meeting corpus, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp.364-367, 2003. ,
The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms, Language Resources and Evaluation, vol.41, issue.3-4, pp.389-407, 2007. ,
Interpretation of multiparty meetings: The AMI and AMIDA projects, Proc. 2nd Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), pp.115-118, 2008. ,
, Lincoln laboratory speech enhancement corpus, LLSEC, 1996.
The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments, Computer Speech and Language, vol.26, issue.1, pp.52-66, 2011. ,
The Sheffield wargames corpus, Proc. Interspeech, pp.1116-1120, 2013. ,
Voices obscured in complex environmental settings (VOiCES) corpus, Proc. Interspeech, pp.1566-1570, 2018. ,
, DiPCo-dinner party corpus, 2019.
The fifth 'CHiME' speech separation and recognition challenge: Dataset, task and baselines, Proc. Interspeech, pp.1561-1565, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01744021
The second DIHARD diarization challenge: Dataset, task, and baselines, Proc. Interspeech, pp.978-982, 2019. ,
The Kaldi speech recognition toolkit, Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2011. ,
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system, Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp.6665-6669, 2019. ,
Semi-orthogonal low-rank matrix factorization for deep neural networks, Proc. Interspeech, pp.3743-3747, 2018. ,
Front-end processing for the CHiME-5 dinner party scenario, Proc. 5th Int. Workshop on Speech Processing in Everyday Environments, pp.35-40, 2018. ,
Acoustic beamforming for speaker diarization of meetings, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, issue.7, pp.2011-2021, 2007. ,
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing, ITG Fachtagung Sprachkommunikation (ITG), 2018. ,
Speech dereverberation based on variance-normalized delayed linear prediction, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1717-1731, 2010. ,
Voxceleb: Large-scale speaker verification in the wild, Computer Speech and Language, vol.60, p.101027, 2020. ,
Diarization is hard: Some experiences and lessons learned for the JHU team in the inaugural DIHARD challenge, Proc. Interspeech, pp.2808-2812, 2018. ,
Bayesian speaker verification with heavy tailed priors, Proc. Odyssey, 2010. ,
Acoustic modelling from the signal domain using CNNs, Proc. Interspeech, pp.3434-3438, 2016. ,
X-vectors: Robust DNN embeddings for speaker recognition, Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp.5329-5333, 2018. ,
Speaker diarization: A review of recent research, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.356-370, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00733397
,
The USTC-iFlytek systems for CHiME-5 challenge, Proc. 5th Int. Workshop on Speech Processing in Everyday Environments, pp.11-15, 2018. ,