T. Virtanen, D. Mark, D. Plumbley, and . Ellis, Computational analysis of sound scenes and events, 2018.

R. Serizel, N. Turpault, H. Eghbalzadeh, and A. Shah, Large-scale weakly labeled semi-supervised sound event detection in domestic environments, Proc. DCASE2018, pp.19-23, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01850270

F. Jort, . Gemmeke, P. W. Daniel, D. Ellis, A. Freedman et al., Audio set: An ontology and human-labeled dataset for audio events, Proc. ICASSP, 2017.

J. Salamon and J. P. Bello, Unsupervised feature learning for urban sound classification, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015.

A. Jansen, M. Plakal, R. Pandya, D. Ellis, S. Hershey et al., Unsupervised learning of semantic audio representations, Proc. ICASSP, 2018.

Z. Zhang and B. Schuller, Semi-supervised learning helps in sound event classification, Proc. ICASSP, pp.333-336, 2012.

T. Komatsu, T. Toizumi, R. Kondo, and Y. Senda, Acoustic event detection method using semi-supervised non-negative matrix factorization with a mixture of local dictionaries, Proc. DCASE), pp.45-49, 2016.

B. Elizalde, A. Shah, S. Dalmia, M. H. Lee, R. Badlani et al., An approach for selftraining audio event detectors using web data, Proc. EUSIPCO, pp.1863-1867, 2017.

A. Kumar and B. Raj, Audio event detection using weakly labeled data, CoRR, 2016.

A. Kumar and B. Raj, Audio event and scene recognition: A unified approach using strongly and weakly labeled data, Proc. IJCNN. IEEE, pp.3475-3482, 2017.

A. Mesaros, T. Heittola, and T. Virtanen, Metrics for polyphonic sound event detection, Applied Sciences, vol.6, issue.6, p.162, 2016.

L. Jiakai, Mean teacher convolution system for dcase 2018 task 4, DCASE2018 Challenge, 2018.

Y. Liu-liu, J. Yan, Y. Song, and J. Du, Ustcnelslip system for dcase 2018 challenge task 4, DCASE2018 Challenge, 2018.

Q. Kong, X. Turab, W. Yong, M. D. Wang, and . Plumbley, DCASE 2018 challenge baseline with convolutional neural networks, DCASE2018 Challenge, 2018.

S. Kothinti, K. Imoto, D. Chakrabarty, S. Gregory, S. Watanabe et al., Joint acoustic and class inference for weakly supervised sound event detection, 2018.

R. Harb and F. Pernkopf, Sound event detection using weakly labeled semi-supervised data with gcrnns, vat and self-adaptive label refinement, 2018.

K. Koutini, H. Eghbal-zadeh, and G. Widmer, Iterative knowledge distillation in r-cnns for weakly-labeled semi-supervised sound event detection, 2018.

Y. Guo, M. Xu, J. Wu, Y. Wang, and K. Hoashi, Multi-scale convolutional recurrent neural network with ensemble method for weakly labeled sound event detection, 2018.

Y. Hou and S. Li, Semi-supervised sound event detection with convolutional recurrent neural network using weakly labelled data, DCASE2018 Challenge, 2018.

W. Lim, S. Suh, and Y. Jeong, Weakly labeled semi-supervised sound event detection using crnn with inception module, DCASE2018 Challenge, 2018.

A. Avdeeva and I. Agafonov, Sound event detection using weakly labeled dataset with convolutional recurrent neural network, DCASE2018 Challenge, 2018.

W. Jun and L. Shengchen, Self-attention mechanism based system for dcase2018 challenge task1 and task4, DCASE2018 Challenge, 2018.

T. Leo-cances, P. Pellegrini, and . Guyot, Sound event detection from weak annotations: Weighted gru versus multi-instance learning, DCASE2018 Challenge, 2018.

M. Hyeongi, B. Joon, and K. Bum-jun, End-to-end crnn architectures for weakly supervised sound event detection, 2018.

H. Dinkel, Y. Qiand, and K. Yu, A hybrid asr model approach on weakly labeled scene classification, DCASE2018 Challenge, 2018.

D. Wang, K. Xu, B. Zhu, L. Zhang, Y. Peng et al., A crnn-based system with mixup technique for large-scale weakly labeled sound event detection, 2018.

R. Raj, S. Waldekar, and G. Saha, Largescale weakly labelled semi-supervised cqt based sound event detection in domestic environments, DCASE2018 Challenge, 2018.

A. Tarvainen and H. Valpola, Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, Proc. NIPS, pp.1195-1204, 2017.

S. Sabour, N. Frosst, and G. E. Hinton, Dynamic routing between capsules, Proc. NIPS, pp.3856-3866, 2017.