Audio source separation into the wild

Laurent Girin 1, 2 Sharon Gannot 3 Xiaofei Li 2
1 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition
2 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : This review chapter is dedicated to multichannel audio source separation in real-life environment. We explore some of the major achievements in the field and discuss some of the remaining challenges. We will explore several important practical scenarios, e.g. moving sources and/or microphones, varying number of sources and sensors, high reverberation levels, spatially diffuse sources, and synchronization problems. Several applications such as smart assistants, cellular phones, hearing aids and robots, will be discussed. Our perspectives on the future of the field will be given as concluding remarks of this chapter.
Complete list of metadatas

https://hal.inria.fr/hal-01943375
Contributor : Team Perception <>
Submitted on : Monday, December 3, 2018 - 5:02:45 PM
Last modification on : Friday, December 28, 2018 - 6:50:55 PM
Long-term archiving on : Monday, March 4, 2019 - 2:54:46 PM

File

Book_plain.pdf
Files produced by the author(s)

Identifiers

Citation

Laurent Girin, Sharon Gannot, Xiaofei Li. Audio source separation into the wild. Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, ⟨10.1016/B978-0-12-814601-9.00022-5⟩. ⟨hal-01943375⟩

Share

Metrics

Record views

196

Files downloads

618