Skip to Main content Skip to Navigation
Conference papers

Multichannel music separation with deep neural networks

Aditya Arie Nugraha 1 Antoine Liutkus 1 Emmanuel Vincent 1
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This article addresses the problem of multichannel music separation. We propose a framework where the source spectra are estimated using deep neural networks and combined with spatial covariance matrices to encode the source spatial characteristics. The parameters are estimated in an iterative expectation-maximization fashion and used to derive a multichannel Wiener filter. We evaluate the proposed framework for the task of music separation on a large dataset. Experimental results show that the method we describe performs consistently well in separating singing voice and other instruments from realistic musical mixtures.
Complete list of metadatas

https://hal.inria.fr/hal-01334614
Contributor : Aditya Arie Nugraha <>
Submitted on : Tuesday, June 21, 2016 - 9:51:26 AM
Last modification on : Saturday, November 16, 2019 - 7:04:01 PM
Document(s) archivé(s) le : Thursday, September 22, 2016 - 12:03:10 PM

File

PID4315999.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01334614, version 1

Citation

Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent. Multichannel music separation with deep neural networks. European Signal Processing Conference (EUSIPCO), Aug 2016, Budapest, Hungary. pp.1748-1752. ⟨hal-01334614v1⟩

Share

Metrics

Record views

509

Files downloads

412