Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue IEEE Journal of Selected Topics in Signal Processing Année : 2011

Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds

Jun Wu
  • Fonction : Auteur
  • PersonId : 891099
Stanislaw Andrzej Raczynski
  • Fonction : Auteur
  • PersonId : 891100
Takuya Nishimoto
  • Fonction : Auteur
Nobutaka Ono
  • Fonction : Auteur
  • PersonId : 901589
Shigeki Sagayama
  • Fonction : Auteur
  • PersonId : 835791

Résumé

Polyphonic pitch estimation and musical instrument identification are some of the most challenging tasks in the field of Music Information Retrieval (MIR). While existing approaches have focused on the modeling of harmonic partials, we design a joint Gaussian mixture model of the harmonic partials and the inharmonic attack of each note. This model encodes the power of each partial over time as well as the spectral envelope of the attack part. We derive an Expectation-Maximization (EM) algorithm to estimate the pitch and the parameters of the notes. We then extract timbre features both from the harmonic and the attack part via Principal Component Analysis (PCA) over the estimated model parameters. Musical instrument recognition for each estimated note is finally carried out with a Support Vector Machine (SVM) classifier. Experiments conducted on mixtures of isolated notes as well as real-world polyphonic music show higher accuracy over state-of-the-art approaches based on the modeling of harmonic partials only.
Fichier principal
Vignette du fichier
wu_JSTSP11.pdf (1.08 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00594965 , version 1 (23-05-2011)
inria-00594965 , version 2 (01-06-2011)

Identifiants

  • HAL Id : inria-00594965 , version 2

Citer

Jun Wu, Emmanuel Vincent, Stanislaw Andrzej Raczynski, Takuya Nishimoto, Nobutaka Ono, et al.. Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds. IEEE Journal of Selected Topics in Signal Processing, 2011, 5 (6), pp.1124-1132. ⟨inria-00594965v2⟩
362 Consultations
491 Téléchargements

Partager

Gmail Facebook X LinkedIn More