Improved perceptual metrics for the evaluation of audio source separation

Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We aim to predict the perceived quality of estimated source signals in the context of audio source separation. Recently, we proposed a set of metrics called PEASS that consist of three computation steps: decomposition of the estimation error into three components, measurement of the salience of each component via the PEMO-Q auditory-motivated measure, and combination of these saliences via a nonlinear mapping trained on subjective opinion scores. The parameters of the decomposition were shown to have little influence on the prediction performance. In this paper, we evaluate the impact of the parameters of PEMO-Q and the nonlinear mapping on the prediction performance. By selecting the optimal parameters, we improve the average correlation with mean opinion scores (MOS) from 0.738 to 0.909 in a cross-validation setting. The resulting improved metrics are used in the context of the 2011 Signal Separation Evaluation Campaign (SiSEC).
Liste complète des métadonnées

Cited literature [11 references]  Display  Hide  Download

https://hal.inria.fr/hal-00653196
Contributor : Emmanuel Vincent <>
Submitted on : Monday, December 19, 2011 - 9:07:14 AM
Last modification on : Thursday, March 21, 2019 - 2:20:12 PM
Document(s) archivé(s) le : Tuesday, March 20, 2012 - 2:22:00 AM

File

vincent_LVA12.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00653196, version 1

Citation

Emmanuel Vincent. Improved perceptual metrics for the evaluation of audio source separation. 10th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), Mar 2012, Tel Aviv, Israel. pp.430-437. ⟨hal-00653196⟩

Share

Metrics

Record views

409

Files downloads

535