Skip to Main content Skip to Navigation
Journal articles

From blind to guided audio source separation: How models and side information can improve the separation of sound

Emmanuel Vincent 1 Nancy Bertin 2 Rémi Gribonval 2 Frédéric Bimbot 2
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Abstract : Audio is a domain where signal separation has long been considered as a fascinating objective, potentially offering a wide range of new possibilities and experiences in professional and personal contexts, by better taking advantage of audio material and finely analyzing complex acoustic scenes. It has thus always been a major area for research in signal separation and an exciting challenge for industrial applications. Starting with blind separation of toy mixtures in the mid 90's, research has progressed up to real-world scenarios today, with applications to speech enhancement and recognition, music editing, 3D sound rendering, and audio information retrieval, among others. This has mostly been made possible by the development of increasingly informed separation techniques incorporating knowledge about the sources and/or the mixtures at hand. For instance, speech source separation for remote conferencing can benefit from prior knowledge of the room geometry and/or the names of the speakers, while music remastering will exploit instrument characteristics and knowledge of sound engineers mixing habits. After a brief historical account, we provide an overview of recent and ongoing research in this field, illustrating a variety of models and techniques designed so as to guide the audio source separation process towards efficient and robust solutions.
Complete list of metadata

Cited literature [31 references]  Display  Hide  Download
Contributor : Emmanuel Vincent Connect in order to contact the contributor
Submitted on : Monday, January 6, 2014 - 11:25:58 AM
Last modification on : Thursday, January 20, 2022 - 5:28:34 PM
Long-term archiving on: : Thursday, April 10, 2014 - 3:40:46 PM


Files produced by the author(s)



Emmanuel Vincent, Nancy Bertin, Rémi Gribonval, Frédéric Bimbot. From blind to guided audio source separation: How models and side information can improve the separation of sound. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2014, 31 (3), pp.107-115. ⟨10.1109/MSP.2013.2297440⟩. ⟨hal-00922378⟩



Les métriques sont temporairement indisponibles