Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence

Abstract : In this paper, we investigate the problem of real-time polyphonic music transcription by employing non-negative matrix factorization techniques and the beta-divergence as a cost function. We consider real-world setups where the music signal arrives incrementally to the system and is transcribed as it unfolds in time. The proposed transcription system is addressed with a modified non-negative matrix factorization scheme, called non-negative decomposition, where the incoming signal is projected onto a fixed basis of templates learned off-line prior to the decomposition. We discuss the use of non-negative matrix factorization with the beta-divergence to achieve the real-time decomposition. The proposed system is evaluated on the specific task of piano music transcription and the results show that it can outperform several state-of-the-art off-line approaches.
Type de document :
Communication dans un congrès
ISMIR - 11th International Society for Music Information Retrieval Conference, Aug 2010, Utrecht, Netherlands. pp.489-494, 2010
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00708682
Contributeur : Arnaud Dessein <>
Soumis le : vendredi 15 juin 2012 - 14:59:34
Dernière modification le : vendredi 31 août 2018 - 09:14:29
Document(s) archivé(s) le : dimanche 16 septembre 2012 - 02:55:29

Fichier

Dessein2010ISMIR.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-00708682, version 1

Collections

Citation

Arnaud Dessein, Arshia Cont, Guillaume Lemaitre. Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence. ISMIR - 11th International Society for Music Information Retrieval Conference, Aug 2010, Utrecht, Netherlands. pp.489-494, 2010. 〈hal-00708682〉

Partager

Métriques

Consultations de la notice

915

Téléchargements de fichiers

1106