Musical Source Separation: An Introduction

Estefania Cano; Derry Fitzgerald; Antoine Liutkus; Mark D. Plumbley; Fabian-Robert Stöter

doi:10.1109/MSP.2018.2874719

Article Dans Une Revue IEEE Signal Processing Magazine Année : 2019

Musical Source Separation: An Introduction

(1) , (2) , (3) , (4) , (3)

1
2
3
4

Estefania Cano

Fonction : Auteur
PersonId : 1040005

Fraunhofer Institute for Digital Media Technology [Ilmenau]

Derry Fitzgerald

Fonction : Auteur

CIT Cork School of Music

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Scientific Data Management

Mark D. Plumbley

Fonction : Auteur

Centre for Digital Music

Fabian-Robert Stöter

Fonction : Auteur
PersonId : 741450
IdHAL : fabian-robert-stoter
ORCID : 0000-0002-2534-1165

Scientific Data Management

Résumé

Many people listen to recorded music as part of their everyday lives, for example from radio or TV programmes, CDs, downloads or increasingly from online streaming services. Sometimes we might want to remix the balance within the music, perhaps to make the vocals louder or to suppress an unwanted sound, or we might want to upmix a 2-channel stereo recording to a 5.1- channel surround sound system. We might also want to change the spatial location of a musical instrument within the mix. All of these applications are relatively straightforward, provided we have access to separate sound channels (stems) for each musical audio object. However, if we only have access to the final recording mix, which is usually the case, this is much more challenging. To estimate the original musical sources, which would allow us to remix, suppress or upmix the sources, we need to perform musical source separation (MSS). In the general source separation problem, we are given one or more mixture signals that contain different mixtures of some original source signals. This is illustrated in Figure 1 where four sources, namely vocals, drums, bass and guitar, are all present in the mixture. The task is to recover one or more of the source signals given the mixtures. In some cases, this is relatively straightforward, for example, if there are at least as many mixtures as there are sources, and if the mixing process is fixed, with no delays, filters or non-linear mastering [1]. However, MSS is normally more challenging. Typically, there may be many musical instruments and voices in a 2-channel recording, and the sources have often been processed with the addition of filters and reverberation (sometimes nonlinear) in the recording and mixing process. In some cases, the sources may move, or the production parameters may change, meaning that the mixture is time-varying. All of these issues make MSS a very challenging problem. Nevertheless, musical sound sources have particular properties and structures that can help us. For example, musical source signals often have a regular harmonic structure of frequencies at regular intervals, and can have frequency contours characteristic of each musical instrument. They may also repeat in particular temporal patterns based on the musical structure. In this paper we will explore the MSS problem and introduce approaches to tackle it. We will begin by introducing characteristics of music signals, we will then give an introduction to MSS, and finally consider a range of MSS models. We will also discuss how to evaluate MSS approaches, and discuss limitations and directions for future research

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

Musical%20Source%20Separation%20-%20An%20Introduction.pdf (1.43 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01945345

Soumis le : mercredi 5 décembre 2018-11:40:11

Dernière modification le : jeudi 15 février 2024-03:31:12

Archivage à long terme le : mercredi 6 mars 2019-13:28:10

Dates et versions

hal-01945345 , version 1 (05-12-2018)

Identifiants

HAL Id : hal-01945345 , version 1
DOI : 10.1109/MSP.2018.2874719

Citer

Estefania Cano, Derry Fitzgerald, Antoine Liutkus, Mark D. Plumbley, Fabian-Robert Stöter. Musical Source Separation: An Introduction. IEEE Signal Processing Magazine, 2019, 36 (1), pp.31-40. ⟨10.1109/MSP.2018.2874719⟩. ⟨hal-01945345⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA ZENITH LIRMM INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER UNIV-RENNES ANR UR1-MATH-NUM

578 Consultations

4022 Téléchargements

Musical Source Separation: An Introduction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager