From blind to guided audio source separation: How models and side information can improve the separation of sound

Emmanuel Vincent 1 Nancy Bertin 2 Rémi Gribonval 2 Frédéric Bimbot 2
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE, Inria Rennes – Bretagne Atlantique
Abstract : Audio is a domain where signal separation has long been considered as a fascinating objective, potentially offering a wide range of new possibilities and experiences in professional and personal contexts, by better taking advantage of audio material and finely analyzing complex acoustic scenes. It has thus always been a major area for research in signal separation and an exciting challenge for industrial applications. Starting with blind separation of toy mixtures in the mid 90's, research has progressed up to real-world scenarios today, with applications to speech enhancement and recognition, music editing, 3D sound rendering, and audio information retrieval, among others. This has mostly been made possible by the development of increasingly informed separation techniques incorporating knowledge about the sources and/or the mixtures at hand. For instance, speech source separation for remote conferencing can benefit from prior knowledge of the room geometry and/or the names of the speakers, while music remastering will exploit instrument characteristics and knowledge of sound engineers mixing habits. After a brief historical account, we provide an overview of recent and ongoing research in this field, illustrating a variety of models and techniques designed so as to guide the audio source separation process towards efficient and robust solutions.
Complete list of metadatas

Cited literature [31 references]  Display  Hide  Download

https://hal.inria.fr/hal-00922378
Contributor : Emmanuel Vincent <>
Submitted on : Monday, January 6, 2014 - 11:25:58 AM
Last modification on : Wednesday, April 3, 2019 - 1:23:00 AM
Long-term archiving on : Thursday, April 10, 2014 - 3:40:46 PM

File

vincent_SPM14.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00922378, version 1

Citation

Emmanuel Vincent, Nancy Bertin, Rémi Gribonval, Frédéric Bimbot. From blind to guided audio source separation: How models and side information can improve the separation of sound. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2014, 31 (3), pp.107-115. ⟨hal-00922378⟩

Share

Metrics

Record views

2258

Files downloads

2810