Deep clustering: Discriminative embeddings for segmentation and separation, pp.31-35, 2016. ,
Listening to each speaker one by one with recurrent selective hearing networks, ICASSP, pp.5064-5068, 2018. ,
Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.10, pp.1901-1913, 2017. ,
Conv-TasNet: Surpassing ideal time-frequency magnitude masking for speech separation, Speech, and Language Processing, vol.27, pp.1256-1266, 2019. ,
, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, 2006.
Combining spectral and spatial features for deep learning based blind speaker separation, Speech, and Language Processing, vol.27, pp.457-468, 2019. ,
Deep clusteringbased beamforming for separation with unknown number of sources, pp.1183-1187, 2017. ,
Multichannel speech separation with recurrent neural networks from high-order ambisonics recordings, ICASSP, pp.36-40, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01699759
Multi-Channel overlapped speech recognition with location guided speech extraction network, IEEE Spoken Language Technology Workshop (SLT), pp.558-565, 2018. ,
Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition, EUSIPCO (Submitted), 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02355669
Adaptive blind separation of independent sources: A deflation approach, Signal Processing, vol.45, issue.1, pp.59-83, 1995. ,
Recursive speech separation for unknown number of speakers ,
A consolidated perspective on multimicrophone speech enhancement and source separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01414179
Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01817519
Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network," in EUSIPCO, pp.1462-1466, 2018. ,
Rank-1 constrained multichannel Wiener filter for speech recognition in noisy environments, Computer Speech & Language, vol.49, pp.37-51, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01634449
The fifth 'CHiME' speech separation and recognition challenge: Dataset, task and baselines, pp.1561-1565, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01744021
The second DIHARD diarization challenge: Dataset, task, and baselines, Interspeech, 2019. ,
RIR-Generator: Room impulse response generator, 2018. ,
Analysis of deep clustering as preprocessing for automatic speech recognition of sparsely overlapping speech, ICASSP, 2019. ,
The generalized correlation method for estimation of time delay, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.24, issue.4, pp.320-327, 1976. ,
Purely sequence-trained neural networks for ASR based on lattice-free MMI, pp.2751-2755, 2016. ,