Subjective and objective quality assessment of audio source separation

Valentin Emiya 1 Emmanuel Vincent 1 Niklas Harlander 2 Volker Hohmann 2
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely-available toolkit called PEASS.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2011, 19 (7), pp.2046-2057. 〈http://ieeexplore.ieee.org/search/srchabstract.jsp?tp=&arnumber=5704564〉. 〈10.1109/TASL.2011.2109381〉
Liste complète des métadonnées

Littérature citée [36 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00567152
Contributeur : Valentin Emiya <>
Soumis le : vendredi 18 février 2011 - 14:56:21
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : jeudi 19 mai 2011 - 02:55:15

Fichier

emiya2011.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Valentin Emiya, Emmanuel Vincent, Niklas Harlander, Volker Hohmann. Subjective and objective quality assessment of audio source separation. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2011, 19 (7), pp.2046-2057. 〈http://ieeexplore.ieee.org/search/srchabstract.jsp?tp=&arnumber=5704564〉. 〈10.1109/TASL.2011.2109381〉. 〈inria-00567152〉

Partager

Métriques

Consultations de la notice

670

Téléchargements de fichiers

4682