Robust singer identification in polyphonic music using melody enhancement and uncertainty-based learning

Abstract : Enhancing specific parts of a polyphonic music signal is believed to be a promising way of breaking the glass ceiling that most Music Information Retrieval (MIR) systems are now facing. The use of signal enhancement as a pre-processing step has led to limited improvement though, because distortions inevitably remain in the enhanced signals that may propagate to the subsequent feature extraction and classification stages. Previous studies attempting to reduce the impact of these distortions have relied on the use of feature weighting or missing feature theory. Based on advances in the field of noise-robust speech recognition, we represent the uncertainty about the enhanced signals via a Gaussian distribution instead that is subsequently propagated to the features and to the classifier. We introduce new methods to estimate the uncertainty from the signal in a fully automatic manner and to learn the classifier directly from polyphonic data. We illustrate the results by considering the task of identifying, from a given set of singers, which one is singing at a given time in a given song. Experimental results demonstrate the relevance of our approach.
Type de document :
Communication dans un congrès
13th International Society for Music Information Retrieval Conference (ISMIR), Oct 2012, Porto, Portugal. 2012
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00709826
Contributeur : Emmanuel Vincent <>
Soumis le : mardi 19 juin 2012 - 15:06:23
Dernière modification le : mercredi 11 avril 2018 - 01:50:58
Document(s) archivé(s) le : jeudi 15 décembre 2016 - 16:12:15

Fichier

lagrange_ISMIR12.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00709826, version 1

Citation

Mathieu Lagrange, Alexey Ozerov, Emmanuel Vincent. Robust singer identification in polyphonic music using melody enhancement and uncertainty-based learning. 13th International Society for Music Information Retrieval Conference (ISMIR), Oct 2012, Porto, Portugal. 2012. 〈hal-00709826〉

Partager

Métriques

Consultations de la notice

407

Téléchargements de fichiers

203