Multichannel music separation with deep neural networks

Aditya Arie Nugraha; Antoine Liutkus; Emmanuel Vincent

Communication Dans Un Congrès Année : 2016

Multichannel music separation with deep neural networks

(1) , (1) , (1)

Aditya Arie Nugraha

Fonction : Auteur
PersonId : 967049

Speech Modeling for Facilitating Oral-Based Communication

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Speech Modeling for Facilitating Oral-Based Communication

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Résumé

This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

PID4315999.pdf (176.8 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aditya Arie Nugraha : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01334614

Soumis le : mardi 21 juin 2016-09:51:26

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : jeudi 22 septembre 2016-12:03:10

Dates et versions

hal-01334614 , version 1 (21-06-2016)

hal-01334614 , version 2 (14-06-2017)

Identifiants

HAL Id : hal-01334614 , version 1

Citer

Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent. Multichannel music separation with deep neural networks. European Signal Processing Conference (EUSIPCO), Aug 2016, Budapest, Hungary. pp.1748-1752. ⟨hal-01334614v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

520 Consultations

1112 Téléchargements

Multichannel music separation with deep neural networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager