Skip to Main content Skip to Navigation
Journal articles

Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds

Abstract : Polyphonic pitch estimation and musical instrument identification are some of the most challenging tasks in the field of Music Information Retrieval (MIR). While existing approaches have focused on the modeling of harmonic partials, we design a joint Gaussian mixture model of the harmonic partials and the inharmonic attack of each note. This model encodes the power of each partial over time as well as the spectral envelope of the attack part. We derive an Expectation-Maximization (EM) algorithm to estimate the pitch and the parameters of the notes. We then extract timbre features both from the harmonic and the attack part via Principal Component Analysis (PCA) over the estimated model parameters. Musical instrument recognition for each estimated note is finally carried out with a Support Vector Machine (SVM) classifier. Experiments conducted on mixtures of isolated notes as well as real-world polyphonic music show higher accuracy over state-of-the-art approaches based on the modeling of harmonic partials only.
Complete list of metadatas

Cited literature [29 references]  Display  Hide  Download

https://hal.inria.fr/inria-00594965
Contributor : Emmanuel Vincent <>
Submitted on : Wednesday, June 1, 2011 - 6:46:28 PM
Last modification on : Friday, July 10, 2020 - 4:01:04 PM
Long-term archiving on: : Friday, September 2, 2011 - 2:30:57 AM

File

wu_JSTSP11.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00594965, version 2

Citation

Jun Wu, Emmanuel Vincent, Stanislaw Raczynski, Takuya Nishimoto, Nobutaka Ono, et al.. Polyphonic pitch estimation and instrument identification by joint modeling of sustained and attack sounds. IEEE Journal of Selected Topics in Signal Processing, IEEE, 2011, 5 (6), pp.1124-1132. ⟨inria-00594965v2⟩

Share

Metrics

Record views

639

Files downloads

507