Skip to Main content Skip to Navigation
Other publications

Two nonnegative matrix factorization methods for polyphonic pitch transcription

Emmanuel Vincent 1 Nancy Bertin 2 Roland Badeau 2
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Polyphonic pitch transcription consists of estimating the onset time, duration and pitch of each note within a music signal. Adaptive signal models such as Nonnegative Matrix Factorization (NMF) appear well suited to this task, since they can provide a meaningful representation whatever instruments are playing. In this paper, we propose a simple transcription method using minimum residual loudness NMF, harmonic comb-based pitch identification and threshold-based onset/offset detection, and investigate a second method incorporating harmonicity constraints in the NMF model. Both methods are evaluated in the framework of MIREX 2007.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.inria.fr/inria-00544213
Contributor : Emmanuel Vincent <>
Submitted on : Tuesday, December 7, 2010 - 3:01:12 PM
Last modification on : Friday, December 20, 2019 - 1:36:38 AM
Document(s) archivé(s) le : Tuesday, March 8, 2011 - 4:31:43 AM

File

vincent_MIREX07.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : inria-00544213, version 1

Citation

Emmanuel Vincent, Nancy Bertin, Roland Badeau. Two nonnegative matrix factorization methods for polyphonic pitch transcription. 2007. ⟨inria-00544213⟩

Share

Metrics

Record views

605

Files downloads

412