Skip to Main content Skip to Navigation
New interface
Book sections

Audio source separation into the wild

Laurent Girin 1, 2 Sharon Gannot 3 Xiaofei Li 2 
2 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology, LJK - Laboratoire Jean Kuntzmann
Abstract : This review chapter is dedicated to multichannel audio source separation in real-life environment. We explore some of the major achievements in the field and discuss some of the remaining challenges. We will explore several important practical scenarios, e.g. moving sources and/or microphones, varying number of sources and sensors, high reverberation levels, spatially diffuse sources, and synchronization problems. Several applications such as smart assistants, cellular phones, hearing aids and robots, will be discussed. Our perspectives on the future of the field will be given as concluding remarks of this chapter.
Complete list of metadata
Contributor : Perception team Connect in order to contact the contributor
Submitted on : Monday, December 3, 2018 - 5:02:45 PM
Last modification on : Tuesday, October 25, 2022 - 4:17:50 PM
Long-term archiving on: : Monday, March 4, 2019 - 2:54:46 PM


Files produced by the author(s)



Laurent Girin, Sharon Gannot, Xiaofei Li. Audio source separation into the wild. Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, ⟨10.1016/B978-0-12-814601-9.00022-5⟩. ⟨hal-01943375⟩



Record views


Files downloads