Deep clustering: Discriminative embeddings for segmentation and separation, ICASSP, pp.31-35, 2016. ,
Conv-tasnet: Surpassing ideal time-frequency magnitude masking for speech separation, IEEE/ACM transactions on audio, speech, and language processing, vol.27, issue.8, pp.1256-1266, 2019. ,
Universal sound separation, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.175-179, 2019. ,
Speech enhancement for binaural hearing aids based on blind source separation, ISCCSP, pp.1-6, 2010. ,
Front-end processing for the chime-5 dinner party scenario, 2018. ,
Analysis of deep clustering as preprocessing for automatic speech recognition of sparsely overlapping speech, 2019. ,
End-to-end training of time domain audio separation and recognition, ICASSP, pp.7004-7008, 2020. ,
All-neural online source separation, counting, and diarization for meeting analysis, ICASSP, pp.91-95, 2019. ,
Improving universal sound separation using sound classification, ICASSP, pp.96-100, 2020. ,
What's all the fuss about free universal sound separation data?, 2020. ,
Improving robustness of deep neural network acoustic models via speech separation and joint adaptive training, IEEE/ACM transactions on audio, vol.23, issue.1, pp.92-101, 2014. ,
End-to-end multi-speaker speech recognition, ICASSP. IEEE, pp.4819-4823, 2018. ,
Studentteacher network learning with enhanced features, ICASSP, pp.5275-5279, 2017. ,
Spectral feature mapping with mimic loss for robust speech recognition, ICASSP. IEEE, pp.5609-5613, 2018. ,
Sound event detection in domestic environments with weakly labeled data and soundscape synthesis, Workshop on Detection and Classification of Acoustic Scenes and Events, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02160855
The sins database for detection of daily activities in a home environment using an acoustic sensor network, 2017. ,
TUT database for acoustic scene classification and sound event detection, EUSIPCO, 2016. ,
Mean teacher with data agumentation for dcase 2019 task 4, 2009. ,
Mean teachers are better role models: Weight-averaged consistency targets improve semisupervised deep learning results, Advances in neural information processing systems, pp.1195-1204, 2017. ,
Sdrhalf-baked or well done?" in ICASSP, pp.626-630, 2019. ,
Bridging the gap between monaural speech enhancement and recognition with distortionindependent acoustic modeling, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.28, pp.39-48, 2019. ,
A purely end-to-end system for multi-speaker speech recognition, 2018. ,
End-to-end monaural multi-speaker asr system without pretraining, ICASSP, pp.6256-6260, 2019. ,
Permutation invariant training of deep models for speaker-independent multi-talker speech separation, ICASSP, pp.241-245, 2017. ,
Multitalker speech separation with utterance-level permutation invariant training of deep recurrent neural networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.10, pp.1901-1913, 2017. ,
Asteroid: the pytorch-based audio source separation toolkit for researchers, 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02962964
Perceptual losses for real-time style transfer and super-resolution, pp.694-711, 2016. ,
, Speech denoising with deep feature losses, 2018.
End-to-End Neural Speaker Diarization with Permutation-Free Objectives, pp.4300-4304, 2019. ,
Adam: A method for stochastic optimization, 2014. ,