From blind to guided audio source separation: How models and side information can improve the separation of sound

Emmanuel Vincent 1 Nancy Bertin 2 Rémi Gribonval 2 Frédéric Bimbot 2
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Inria Rennes – Bretagne Atlantique , IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
Abstract : Audio is a domain where signal separation has long been considered as a fascinating objective, potentially offering a wide range of new possibilities and experiences in professional and personal contexts, by better taking advantage of audio material and finely analyzing complex acoustic scenes. It has thus always been a major area for research in signal separation and an exciting challenge for industrial applications. Starting with blind separation of toy mixtures in the mid 90's, research has progressed up to real-world scenarios today, with applications to speech enhancement and recognition, music editing, 3D sound rendering, and audio information retrieval, among others. This has mostly been made possible by the development of increasingly informed separation techniques incorporating knowledge about the sources and/or the mixtures at hand. For instance, speech source separation for remote conferencing can benefit from prior knowledge of the room geometry and/or the names of the speakers, while music remastering will exploit instrument characteristics and knowledge of sound engineers mixing habits. After a brief historical account, we provide an overview of recent and ongoing research in this field, illustrating a variety of models and techniques designed so as to guide the audio source separation process towards efficient and robust solutions.
Type de document :
Article dans une revue
IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2014, 31 (3), pp.107-115
Liste complète des métadonnées

Littérature citée [31 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00922378
Contributeur : Emmanuel Vincent <>
Soumis le : lundi 6 janvier 2014 - 11:25:58
Dernière modification le : mercredi 16 mai 2018 - 11:24:07
Document(s) archivé(s) le : jeudi 10 avril 2014 - 15:40:46

Fichier

vincent_SPM14.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00922378, version 1

Citation

Emmanuel Vincent, Nancy Bertin, Rémi Gribonval, Frédéric Bimbot. From blind to guided audio source separation: How models and side information can improve the separation of sound. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2014, 31 (3), pp.107-115. 〈hal-00922378〉

Partager

Métriques

Consultations de la notice

2171

Téléchargements de fichiers

2000