Skip to Main content Skip to Navigation
New interface
Conference papers

Improved perceptual metrics for the evaluation of audio source separation

Emmanuel Vincent 1 
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We aim to predict the perceived quality of estimated source signals in the context of audio source separation. Recently, we proposed a set of metrics called PEASS that consist of three computation steps: decomposition of the estimation error into three components, measurement of the salience of each component via the PEMO-Q auditory-motivated measure, and combination of these saliences via a nonlinear mapping trained on subjective opinion scores. The parameters of the decomposition were shown to have little influence on the prediction performance. In this paper, we evaluate the impact of the parameters of PEMO-Q and the nonlinear mapping on the prediction performance. By selecting the optimal parameters, we improve the average correlation with mean opinion scores (MOS) from 0.738 to 0.909 in a cross-validation setting. The resulting improved metrics are used in the context of the 2011 Signal Separation Evaluation Campaign (SiSEC).
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Emmanuel Vincent Connect in order to contact the contributor
Submitted on : Monday, December 19, 2011 - 9:07:14 AM
Last modification on : Friday, May 6, 2022 - 4:26:02 PM
Long-term archiving on: : Tuesday, March 20, 2012 - 2:22:00 AM


Files produced by the author(s)


  • HAL Id : hal-00653196, version 1


Emmanuel Vincent. Improved perceptual metrics for the evaluation of audio source separation. 10th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), Mar 2012, Tel Aviv, Israel. pp.430-437. ⟨hal-00653196⟩



Record views


Files downloads