T. Virtanen, M. D. Plumbley, and D. Ellis, Computational analysis of sound scenes and events, 2018.

E. Benetos, G. Lafay, M. Lagrange, and M. D. Plumbley, Detection of overlapping acoustic events using a temporallyconstrained probabilistic model, ICASSP
URL : https://hal.archives-ouvertes.fr/hal-01255074

J. Salamon and J. P. Bello, Feature learning with deep scattering for urban sound analysis, 2015 23rd European Signal Processing Conference (EUSIPCO), pp.724-728, 2015.

R. Serizel, N. Turpault, A. Shah, and J. Salamon, Sound event detection in synthetic domestic environments, Proc. ICASSP, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02355573

A. Mesaros, T. Heittola, and T. Virtanen, Tut database for acoustic scene classification and sound event detection, 2016 24th European Signal Processing Conference, pp.1128-1132, 2016.

E. Benetos, G. Lafay, M. Lagrange, and M. D. Plumbley, Detection of overlapping acoustic events using a temporallyconstrained probabilistic model, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6450-6454, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01255074

V. Bisot, S. Essid, and G. Richard, Overlapping sound event detection with supervised nonnegative matrix factorization, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.31-35, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02713341

S. Adavanne, A. Politis, and T. Virtanen, Multichannel sound event detection using 3d convolutional neural networks for learning inter-channel features, 2018 International Joint Conference on Neural Networks (IJCNN), pp.1-7, 2018.

T. Heittola, A. Mesaros, T. Virtanen, and M. Gabbouj, Supervised model training for overlapping sound events based on unsupervised source separation, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.8677-8681, 2013.

Q. Kong, Y. Wang, X. Song, Y. Cao, W. Wang et al., Source separation with weakly labelled data: An approach to computational auditory scene analysis, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.101-105, 2020.

I. Kavalerov, S. Wisdom, H. Erdogan, B. Patton, K. Wilson et al., Universal sound separation, Proc. WASPAA, 2019.

E. Tzinis, S. Wisdom, J. R. Hershey, A. Jansen, and D. P. Ellis, Improving universal sound separation using sound classification, Proc. ICASSP, 2020.

M. Olvera, E. Vincent, R. Serizel, and G. Gasso, Foreground-Background Ambient Sound Scene Separation, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02567542

N. Turpault and R. Serizel, Training sound event detection on a heterogeneous dataset, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02891665

N. Turpault, R. Serizel, A. P. Shah, and J. Salamon, Sound event detection in domestic environments with weakly labeled data and soundscape synthesis, Proc. DCASE Workshop, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02160855

S. Wisdom, E. Tzinis, H. Erdogan, R. J. Weiss, K. Wilson et al., Unsupervised sound separation using mixtures of mixtures, 2020.

J. F. Gemmeke, D. P. Ellis, D. Freedman, A. Jansen, W. Lawrence et al., Audio set: An ontology and human-labeled dataset for audio events, Proc. ICASSP, 2017.

J. Salamon, D. Macconnell, M. Cartwright, P. Li, and J. P. Bello, Scaper: A library for soundscape synthesis and augmentation, Proc. WASPAA, pp.344-348, 2017.

F. Font, G. Roma, and X. Serra, Freesound technical demo, Proc. ACM, pp.411-412, 2013.

E. Fonseca, X. Favory, J. Pons, F. Font, and X. Serra, FSD50k: an open dataset of human-labeled sound events, 2020.

G. Dekkers, S. Lauwereins, B. Thoen, M. W. Adhana, H. Brouckxon et al., The SINS database for detection of daily activities in a home environment using an acoustic sensor network, Proc. DCASE Workshop, pp.32-36, 2017.

A. Mesaros, T. Heittola, and T. Virtanen, TUT database for acoustic scene classification and sound event detection, 2016 24th European Signal Processing Conference, pp.1128-1132

S. Wisdom, H. Erdogan, D. P. Ellis, R. Serizel, N. Turpault et al., What's all the FUSS about free universal sound separation data?, 2020.

E. Fonseca, J. Pons, X. Favory, F. Font, D. Bogdanov et al., Freesound datasets: a platform for the creation of open audio datasets, Proc. ISMIR, pp.486-493, 2017.

Y. Luo and N. Mesgarani, Conv-TasNet: Surpassing ideal time-frequency magnitude masking for speech separation, vol.27, pp.1256-1266, 2019.

S. Wisdom, J. R. Hershey, K. Wilson, J. Thorpe, M. Chinen et al., Differentiable consistency constraints for improved deep speech enhancement, Proc. ICASSP, 2019.

J. L. Roux, S. Wisdom, H. Erdogan, and J. R. Hershey, SDR-half-baked or well done, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.626-630, 2019.

A. Mesaros, T. Heittola, and T. Virtanen, Metrics for polyphonic sound event detection, Applied Sciences, vol.6, issue.6, 2016.

C. Bilen, G. Ferroni, F. Tuveri, J. Azcarreta, and S. Krstulovic, A framework for the robust evaluation of sound event detection, Proc. ICASSP, 2020.

D. Yu, M. Kolbaek, Z. Tan, and J. Jensen, Permutation invariant training of deep models for speaker-independent multi-talker speech separation, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.241-245, 2017.