Spatially Robust Far-field Beamforming Using the von Mises(-Fisher) Distribution, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.12, pp.2189-2197, 2015. ,
DOI : 10.1109/TASLP.2015.2473684
Acoustic Beamforming for Speaker Diarization of Meetings, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.7, pp.2011-2023, 2007. ,
DOI : 10.1109/TASL.2007.902460
Equivalence between Frequency-Domain Blind Source Separation and Frequency-Domain Adaptive Beamforming for Convolutive Mixtures, EURASIP Journal on Advances in Signal Processing, vol.2003, issue.11, pp.1157-1166, 2003. ,
DOI : 10.1155/S1110865703305074
Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.496-503, 2015. ,
DOI : 10.1109/ASRU.2015.7404836
Developments and directions in speech recognition and understanding, Part 1 [DSP Education], IEEE Signal Processing Magazine, vol.26, issue.3, pp.75-80, 2009. ,
DOI : 10.1109/MSP.2009.932166
Robust coherence-based spectral enhancement for distant speech recognition, 2015. ,
The third ???CHiME??? speech separation and recognition challenge: Dataset, task and baselines, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.504-511, 2015. ,
DOI : 10.1109/ASRU.2015.7404837
URL : https://hal.archives-ouvertes.fr/hal-01211376
The third ???CHiME??? speech separation and recognition challenge: Analysis and outcomes, Computer Speech & Language, 2016. ,
DOI : 10.1016/j.csl.2016.10.005
URL : https://hal.archives-ouvertes.fr/hal-01382108
The PASCAL CHiME speech separation and recognition challenge, Computer Speech & Language, vol.27, issue.3, pp.621-633, 2013. ,
DOI : 10.1016/j.csl.2012.10.004
URL : https://hal.archives-ouvertes.fr/hal-00646370
The MGB challenge: Evaluating multi-genre broadcast media recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.687-693, 2015. ,
DOI : 10.1109/ASRU.2015.7404863
On the relationship between Early-to-Late Ratio of Room Impulse Responses and ASR performance in reverberant environments, Speech Communication, vol.76, pp.170-185, 2016. ,
DOI : 10.1016/j.specom.2015.09.004
Noise Perturbation Improves Supervised Speech Separation, Proc. 12th Int. Conf. on Latent Variable Analysis and Signal Separation, pp.83-90, 2015. ,
DOI : 10.1007/978-3-319-22482-4_10
Speech processing in modern communication: Challenges and perspectives, 2010. ,
DOI : 10.1007/978-3-642-11130-3
Robust adaptive beamforming, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.35, issue.10, pp.1365-1376, 1987. ,
DOI : 10.1109/TASSP.1987.1165054
Robust Localization in Reverberant Rooms, Microphone arrays: signal processing techniques and applications, pp.157-180, 2001. ,
DOI : 10.1007/978-3-662-04619-7_8
Superdirective Beamforming Robust Against Microphone Mismatch, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.2, pp.617-631, 2007. ,
DOI : 10.1109/TASL.2006.881676
Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010. ,
DOI : 10.1109/TASL.2010.2050716
URL : https://hal.archives-ouvertes.fr/inria-00435807
A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER), 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, pp.347-354, 1997. ,
DOI : 10.1109/ASRU.1997.659110
The Sheffield wargames corpus, Proc. Interspeech, pp.1116-1120, 2013. ,
Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.416-422, 2015. ,
DOI : 10.1109/ASRU.2015.7404825
Maximum likelihood linear transformations for HMM-based speech recognition, Computer Speech & Language, vol.12, issue.2, pp.75-98, 1998. ,
DOI : 10.1006/csla.1998.0043
Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001. ,
DOI : 10.1109/78.934132
Some statistical issues in the comparison of speech recognition algorithms, International Conference on Acoustics, Speech, and Signal Processing, pp.532-535, 1989. ,
DOI : 10.1109/ICASSP.1989.266481
CU-Move " : Analysis & corpus development for interactive in-vehicle speech systems, Proc. Eurospeech, pp.2023-2026, 2001. ,
The automatic speech recognition in reverberant environments (ASpIRE) challenge, Proc. 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp.547-554, 2015. ,
BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.444-451, 2015. ,
DOI : 10.1109/ASRU.2015.7404829
The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, Proc. ASR2000, pp.181-188, 2000. ,
The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.475-481, 2015. ,
DOI : 10.1109/ASRU.2015.7404833
Modelling non-stationary noise with spectral factorisation in automatic speech recognition, Computer Speech & Language, vol.27, issue.3, pp.763-779, 2013. ,
DOI : 10.1016/j.csl.2012.07.008
Elastic spectral distortion for low resource speech recognition with deep neural networks, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.309-314, 2013. ,
DOI : 10.1109/ASRU.2013.6707748
ivectorbased discriminative adaptation for automatic speech recognition, Proc. 2011 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp.152-157, 2011. ,
Adaptation of deep neural network acoustic models using factorised i-vectors, Proc. Interspeech, pp.2180-2184, 2014. ,
Adaptive Denoising Autoencoders: A Fine-Tuning Scheme to Learn from Test Mixtures, Proc. 12th Int. Conf. on Latent Variable Analysis and Signal Separation, pp.100-107, 2015. ,
DOI : 10.1007/978-3-319-22482-4_12
The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.1-4, 2013. ,
DOI : 10.1109/WASPAA.2013.6701894
Improved backing-off for M-gram language modeling, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.181-184, 1995. ,
DOI : 10.1109/ICASSP.1995.479394
Microphone array processing for distant speech recognition: Towards real-world deployment, Proc. APSIPA Annual Summit and Conf, pp.1-10, 2012. ,
The translingual English database (TED), Proc. 3rd Int. Conf. on Spoken Language Processing (ICSLP), 1994. ,
Robust Automatic Speech Recognition ? A Bridge to Practical Applications, 2015. ,
Scalable audio separation with light Kernel Additive Modelling, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.76-80, 2015. ,
DOI : 10.1109/ICASSP.2015.7177935
URL : https://hal.archives-ouvertes.fr/hal-01114890
Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010. ,
DOI : 10.1109/TASL.2009.2029711
Mutual benefits of auditory spectrotemporal Gabor features and deep learning for the 3rd CHiME challenge, 2015. ,
On diagonal loading for minimum variance beamformers, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795), pp.459-462, 2003. ,
DOI : 10.1109/ISSPIT.2003.1341157
Recurrent neural network based language model, Proc. Interspeech, pp.1045-1048, 2010. ,
Damped oscillator cepstral coefficients for robust speech recognition, Proc. Interspeech, pp.886-890, 2013. ,
Medium-duration modulation cepstral feature for robust speech recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.1749-1753, 2014. ,
DOI : 10.1109/ICASSP.2014.6853898
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.587.296
A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.468-474, 2015. ,
DOI : 10.1109/ASRU.2015.7404832
Multichannel Audio Source Separation With Deep Neural Networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.9, pp.1652-1664, 2016. ,
DOI : 10.1109/TASLP.2016.2580946
URL : https://hal.archives-ouvertes.fr/hal-01163369
Multichannel music separation with deep neural networks, 2016 24th European Signal Processing Conference (EUSIPCO), 2016. ,
DOI : 10.1109/EUSIPCO.2016.7760548
URL : https://hal.archives-ouvertes.fr/hal-01334614
Noise-robust ASR for the third 'CHiME' challenge exploiting time-frequency masking based multi-channel speech enhancement and recurrent neural network, 2015. ,
The kaldi speech recognition toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011. ,
Adaptive beamforming and adaptive training of DNN acoustic models for enhanced multichannel noisy speech recognition, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.401-408, 2015. ,
DOI : 10.1109/ASRU.2015.7404823
The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.275-282, 2015. ,
DOI : 10.1109/ASRU.2015.7404805
Interpretation of Multiparty Meetings the AMI and Amida Projects, 2008 Hands-Free Speech Communication and Microphone Arrays, pp.115-118, 2008. ,
DOI : 10.1109/HSCMA.2008.4538700
Unbiased coherent-to-diffuse ratio estimation for dereverberation, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), pp.6-10, 2014. ,
DOI : 10.1109/IWAENC.2014.6953306
An investigation of deep neural networks for noise robust speech recognition, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7398-7402, 2013. ,
DOI : 10.1109/ICASSP.2013.6639100
Speaker adaptation techniques for automatic speech recognition, Proc. APSIPA ASC, 2011. ,
Suppression of coherent and incoherent noise using a microphone array, Annals of telecommunications, vol.78, pp.439-446, 1994. ,
Robust ASR using neural network based speech enhancement and feature simulation, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.482-489, 2015. ,
DOI : 10.1109/ASRU.2015.7404834
URL : https://hal.archives-ouvertes.fr/hal-01204553
Improvement of microphone array characteristics for speech capturing, Modern Applied Science, vol.9, issue.6, pp.310-319, 2015. ,
The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments, Computer Speech & Language, vol.26, issue.1, pp.52-66, 2011. ,
DOI : 10.1016/j.csl.2010.12.003
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models, 2014 IEEE Spoken Language Technology Workshop (SLT), pp.171-176, 2014. ,
DOI : 10.1109/SLT.2014.7078569
The overview of the MELCO ASR system for the third CHiME challenge, 2015. ,
Oracle estimators for the benchmarking of source separation algorithms, Signal Processing, vol.87, issue.8, pp.1933-1059, 2007. ,
DOI : 10.1016/j.sigpro.2007.01.016
URL : https://hal.archives-ouvertes.fr/inria-00544194
Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.423-429, 2015. ,
DOI : 10.1109/ASRU.2015.7404826
Noise robust IOA/CAS speech separation and recognition system for the third 'CHIME' challenge, 2015. ,
On Training Targets for Supervised Speech Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.1849-1858, 2014. ,
DOI : 10.1109/TASLP.2014.2352935
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR, Proc. 12th Int. Conf. on Latent Variable Analysis and Signal Separation, pp.91-99, 2015. ,
DOI : 10.1007/978-3-319-22482-4_11
URL : https://hal.archives-ouvertes.fr/hal-01163493
Distant Speech Recognition, 2009. ,
An Experimental Study on Speech Enhancement Based on Deep Neural Networks, IEEE Signal Processing Letters, vol.21, issue.1, pp.65-68, 2014. ,
DOI : 10.1109/LSP.2013.2291240
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.436-443, 2015. ,
DOI : 10.1109/ASRU.2015.7404828
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.69-84, 2010. ,
DOI : 10.1109/TASL.2010.2045183
A microphone array with adaptive post-filtering for noise reduction in reverberant rooms, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing, pp.2578-2581, 1988. ,
DOI : 10.1109/ICASSP.1988.197172