Two nonnegative matrix factorization methods for polyphonic pitch transcription

Emmanuel Vincent 1 Nancy Bertin 2 Roland Badeau 2
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Polyphonic pitch transcription consists of estimating the onset time, duration and pitch of each note within a music signal. Adaptive signal models such as Nonnegative Matrix Factorization (NMF) appear well suited to this task, since they can provide a meaningful representation whatever instruments are playing. In this paper, we propose a simple transcription method using minimum residual loudness NMF, harmonic comb-based pitch identification and threshold-based onset/offset detection, and investigate a second method incorporating harmonicity constraints in the NMF model. Both methods are evaluated in the framework of MIREX 2007.
Type de document :
Autre publication
2007 Music Information Retrieval Evaluation eXchange (MIREX). 2007
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00544213
Contributeur : Emmanuel Vincent <>
Soumis le : mardi 7 décembre 2010 - 15:01:12
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : mardi 8 mars 2011 - 04:31:43

Fichier

vincent_MIREX07.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : inria-00544213, version 1

Citation

Emmanuel Vincent, Nancy Bertin, Roland Badeau. Two nonnegative matrix factorization methods for polyphonic pitch transcription. 2007 Music Information Retrieval Evaluation eXchange (MIREX). 2007. 〈inria-00544213〉

Partager

Métriques

Consultations de la notice

555

Téléchargements de fichiers

317