Adaptive harmonic spectral decomposition for multiple pitch estimation

Emmanuel Vincent 1 Nancy Bertin 2 Roland Badeau 2
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Multiple pitch estimation consists of inferring the fundamental frequencies and the salience of the notes forming a music signal over short time frames. This mid-level representation can be exploited as a front-end for higher-level applications, such as music-to-score transcription or chord detection. One approach is to decompose the short-term magnitude spectrum of the signal into a sum of basis spectra representing individual pitches scaled by time-varying amplitudes, using algorithms such as nonnegative matrix factorization (NMF). Prior training of the basis spectra is often infeasible due to the wide range of possible instruments. Appropriate spectra must then be estimated from the observed data, which may result in limited performance due to inaccurately estimated spectra. In this article, we model each basis spectrum as a weighted sum of narrowband spectra representing a few adjacent harmonic partials, thus enforcing harmonicity and spectral smoothness while adapting the spectral envelope to each instrument. We derive a NMF-like algorithm to estimate the model parameters and evaluate it on a database of piano recordings, considering several choices for the narrowband spectra. Performance appears superior to unconstrained adaptive NMF and competitive with supervised NMF based on pre-trained piano spectra. We also apply our approach to woodwind data.
Type de document :
Rapport
[Research Report] PI 1919, 2009, pp.15
Liste complète des métadonnées

https://hal.inria.fr/inria-00350163
Contributeur : Emmanuel Vincent <>
Soumis le : vendredi 10 décembre 2010 - 16:33:09
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : vendredi 11 mars 2011 - 04:04:26

Fichier

techreport_warning.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00350163, version 3

Citation

Emmanuel Vincent, Nancy Bertin, Roland Badeau. Adaptive harmonic spectral decomposition for multiple pitch estimation. [Research Report] PI 1919, 2009, pp.15. 〈inria-00350163v3〉

Partager

Métriques

Consultations de la notice

513

Téléchargements de fichiers

108