Audio source separation into the wild

Laurent Girin; Sharon Gannot; Xiaofei Li

doi:10.1016/B978-0-12-814601-9.00022-5

Chapitre D'ouvrage Année : 2018

Audio source separation into the wild

(1, 2) , (3) , (2)

1
2
3

Laurent Girin

Fonction : Auteur
PersonId : 3682
IdHAL : laurent-girin
ORCID : 0000-0002-9214-8760
IdRef : 088998037

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Interpretation and Modelling of Images and Videos

Sharon Gannot

Fonction : Auteur

Bar-Ilan University [Israël]

Xiaofei Li

Fonction : Auteur

Interpretation and Modelling of Images and Videos

Résumé

This review chapter is dedicated to multichannel audio source separation in real-life environment. We explore some of the major achievements in the field and discuss some of the remaining challenges. We will explore several important practical scenarios, e.g. moving sources and/or microphones, varying number of sources and sensors, high reverberation levels, spatially diffuse sources, and synchronization problems. Several applications such as smart assistants, cellular phones, hearing aids and robots, will be discussed. Our perspectives on the future of the field will be given as concluding remarks of this chapter.

Mots clés

audio source separation linear Gaussian models nonnegative matrix factorization

Domaines

Traitement du signal et de l'image [eess.SP] Apprentissage [cs.LG] Son [cs.SD]

Fichier principal

Book_plain.pdf (323.05 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Perception team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01943375

Soumis le : lundi 3 décembre 2018-17:02:45

Dernière modification le : jeudi 4 avril 2024-20:52:24

Archivage à long terme le : lundi 4 mars 2019-14:54:46

Dates et versions

hal-01943375 , version 1 (03-12-2018)

Identifiants

HAL Id : hal-01943375 , version 1
DOI : 10.1016/B978-0-12-814601-9.00022-5

Citer

Laurent Girin, Sharon Gannot, Xiaofei Li. Audio source separation into the wild. Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, ⟨10.1016/B978-0-12-814601-9.00022-5⟩. ⟨hal-01943375⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA GIPSA GIPSA-DPC LJK LJK_GI LJK_GI_PERCEPTION GIPSA-CRISSP INRIA2

183 Consultations

1513 Téléchargements

Audio source separation into the wild

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager