Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

Benoît Fuentes; Roland Badeau; Gael Richard

Article Dans Une Revue IEEE_J_ASLP Année : 2013

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

(1) , (1) , (1)

Benoît Fuentes

Fonction : Auteur correspondant
PersonId : 743450
IdHAL : benoit-fuentes

Connectez-vous pour contacter l'auteur

Laboratoire Traitement et Communication de l'Information

Roland Badeau

Fonction : Auteur correspondant
PersonId : 1121
IdHAL : rbadeau
ORCID : 0000-0002-9630-6877
IdRef : 106938134

Connectez-vous pour contacter l'auteur

Laboratoire Traitement et Communication de l'Information

Gael Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Laboratoire Traitement et Communication de l'Information

Résumé

Recently, new methods for smart decomposition of time-frequency representations of audio have been proposed in order to address the problem of blind automatic music transcription. However those techniques are not necessarily suitable for notes having variations of both pitch and spectral envelope over time. The HALCA (Harmonic Adaptive Latent Component Analysis) model presented in this article allows considering those two kinds of variations simultaneously. Each note in a constant-Q transform is locally modeled as a weighted sum of fixed narrowband harmonic spectra, spectrally convolved with some impulse that defines the pitch. All parameters are estimated by means of the expectation-maximization (EM) algorithm, in the framework of Probabilistic Latent Component Analysis. Interesting priors over the parameters are also introduced in order to help the EM algorithm converging towards a meaningful solution. We applied this model for automatic music transcription: the onset time, duration and pitch of each note in an audio file are inferred from the estimated parameters. The system has been evaluated on two different databases and obtains very promising results.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

article-2013-13579-5.pdf (669.96 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Roland Badeau : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00945197

Soumis le : lundi 24 mars 2014-15:28:07

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : mardi 24 juin 2014-10:42:06

Dates et versions

hal-00945197 , version 1 (24-03-2014)

Identifiants

HAL Id : hal-00945197 , version 1

Citer

Benoît Fuentes, Roland Badeau, Gael Richard. Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription. IEEE_J_ASLP, 2013, 21 (9), pp.1854--1866. ⟨hal-00945197⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI IDS S2A

103 Consultations

430 Téléchargements

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager