Computational auditory scene analysis, Computer Speech & Language, vol.8, pp.297-336, 1994. ,
, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, 2006.
Deep clustering: Discriminative embeddings for segmentation and separation, pp.31-35, 2016. ,
Deep attractor network for single-microphone speaker separation, pp.246-250, 2017. ,
2 Speech separation model and AM trained using true DOA values were used since the corresponding models trained using estimated DOAs performed poorly, ICASSP, pp.5064-5068, 2018. ,
Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.10, pp.1901-1913, 2017. ,
, Microphone Arrays: Signal Processing Techniques and Applications, Digital Signal Processing, 2001.
A consolidated perspective on multimicrophone speech enhancement and source separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01414179
Combining spectral and spatial features for deep learning based blind speaker separation, Speech, and Language Processing, vol.27, pp.457-468, 2019. ,
Multichannel speech separation with recurrent neural networks from high-order ambisonics recordings, ICASSP, pp.36-40, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01699759
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network, IEEE Spoken Language Technology Workshop (SLT), pp.558-565, 2018. ,
DOA-informed source extraction in the presence of competing talkers and background noise, EURASIP Journal on Advances in Signal Processing, vol.2017, issue.1, p.60, 2017. ,
Cracking the cocktail party problem by multi-beam deep attractor network, ASRU, pp.437-444, 2017. ,
On the impact of localization errors on HRTF-based robust least-squares beamforming, pp.1072-1075, 2016. ,
The fifth 'CHiME' speech separation and recognition challenge: Dataset, task and baselines, pp.1561-1565, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01744021
The generalized correlation method for estimation of time delay, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.24, issue.4, pp.320-327, 1976. ,
Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01817519
, Distant Speech Recognition, 2009.
Blind acoustic beamforming based on generalized eigenvalue decomposition, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, issue.5, pp.1529-1539, 2007. ,
Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction, Signal Processing, vol.84, issue.12, pp.2367-2387, 2004. ,
Rank-1 constrained multichannel Wiener filter for speech recognition in noisy environments, Computer Speech & Language, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01634449
RIR-Generator: Room impulse response generator, 2018. ,
The second DIHARD diarization challenge: Dataset, task, and baselines, 2019. ,
Multi-Channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation, ICASSP, pp.1-5, 2018. ,
Purely sequence-trained neural networks for ASR based on lattice-free MMI, pp.2751-2755, 2016. ,