Audio Source Separation and Speech Enhancement, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01881431
The Flexible Audio Source Separation Toolbox Version 2.0, ICASSP Show & Tell, 2014. ,
An open source software system for robot audition HARK and its evaluation, Humanoids, pp.561-566, 2008. ,
The ManyEars open framework, Autonomous Robots, vol.34, pp.217-232, 2013. ,
Blind enhancement of the rhythmic and harmonic sections by nmf: Does it help, pp.361-364, 2009. ,
Deep clustering: discriminative embeddings for segmentation and separation, ICASSP, pp.31-35, 2016. ,
Permutation invariant training of deep models for speaker-independent multi-talker speech separation, pp.241-245, 2017. ,
TasNet: Time-domain audio separation network for real-time, single-channel speech separation, in ICASSP, pp.696-700, 2018. ,
Surpassing ideal time-frequency magnitude masking for speech separation, IEEE/ACM Trans. Audio, Speech, Lang. Process, vol.27, issue.8, pp.1256-1266, 2019. ,
Wavesplit: End-to-end speech separation by speaker clustering, 2020. ,
The Northwestern University Source Separation Library, in ISMIR, pp.297-305, 2018. ,
Onssen: an open-source speech separation and enhancement library, 2019. ,
Open-Unmix -a reference implementation for music source separation, J. Open Source Soft, vol.4, issue.41, p.1667, 2019. ,
Py-Torch: An imperative style, high-performance deep learning library, 2019. ,
Two-step sound source separation: Training on learned latent targets, ICASSP, pp.31-35, 2020. ,
Dual-path RNN: Efficient long sequence modeling for time-domain single-channel speech separation, ICASSP, pp.46-50, 2020. ,
Single-channel multi-speaker separation using deep clustering, pp.545-549, 2016. ,
Deep attractor network for single-microphone speaker separation, pp.246-250, 2017. ,
Demystifying TasNet: A dissecting approach, ICASSP, pp.6359-6363, 2020. ,
A comprehensive study of speech separation: Spectrogram vs waveform separation, pp.4574-4578, 2019. ,
Universal sound separation, WASPAA, pp.175-179, 2019. ,
Filterbank design for end-to-end speech separation, ICASSP, pp.6364-6368, 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02355623
A multi-phase gammatone filterbank for speech separation via TasNet, ICASSP, pp.36-40, 2020. ,
Speaker recognition from raw waveform with SincNet, in SLT, pp.1021-1028, 2018. ,
The NumPy array: A structure for efficient numerical computation, Computing in Science and Engineering, vol.13, issue.2, pp.22-30, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00564007
Signal estimation from modified shorttime Fourier transform, IEEE Trans. Acoust., Speech, Signal Process, vol.32, issue.2, pp.236-243, 1984. ,
A fast Griffin-Lim algorithm, WASPAA, pp.1-4, 2013. ,
Iterative phase estimation for the synthesis of separated sources from single-channel mixtures, IEEE Signal Process. Letters, vol.17, issue.5, pp.421-424, 2010. ,
SDRhalf-baked or well done, ICASSP, pp.626-630, 2019. ,
A deep learning loss function based on the perceptual evaluation of the speech quality, IEEE Signal Process. Letters, vol.25, issue.11, pp.1680-1684, 2018. ,
Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks, IEEE/ACM Trans. Audio, Speech, Lang. Process, vol.25, issue.10, pp.1901-1913, 2017. ,
WHAM!: extending speech separation to noisy environments, pp.1368-1372, 2019. ,
WHAMR!: Noisy and reverberant single-channel speech separation, ICASSP, pp.696-700, 2020. ,
LibriMix: An open-source dataset for generalizable speech separation, 2020. ,
What's all the fuss about free universal sound separation data?, 2020. ,
The Interspeech 2020 deep noise suppression challenge: Datasets, subjective speech quality and testing framework, 2020. ,
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition, 2019. ,
Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition, 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02355669
The MUSDB18 corpus for music separation, 2017. ,
Pytorch lightning, 2019. ,
Tight integration of spatial and spectral features for BSS with deep clustering embeddings, pp.2650-2654, 2017. ,
Performance measurement in blind audio source separation, IEEE/ACM Trans. Audio, Speech, Lang. Process, vol.14, issue.4, pp.1462-1469, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00544230
Perceptual evaluation of speech quality (PESQ) -a new method for speech quality assessment of telephone networks and codecs, ICASSP, vol.2, pp.749-752, 2001. ,
An algorithm for intelligibility prediction of time-frequency weighted noisy speech, IEEE/ACM Trans. Audio, Speech, Lang. Process, vol.19, issue.7, pp.2125-2136, 2011. ,
The Kaldi speech recognition toolkit, 2011. ,
Alternative objective functions for deep clustering, in ICASSP, pp.686-690, 2018. ,