Perceptually-based evaluation of the errors usually made when automatically transcribing music

Abstract : This paper investigates the perceptual importance of typical errors occurring when transcribing polyphonic music excerpts into a symbolic form. The case of the automatic transcription of piano music is taken as the target application and two subjective tests are designed. The main test aims at understanding how human subjects rank typical transcription errors such as note insertion, deletion or replacement, note doubling, incorrect note onset or duration, and so forth. The Bradley-Terry-Luce (BTL) analysis framework is used and the results show that pitch errors are more clearly perceived than incorrect loudness estimations or temporal deviations from the original recording. A second test presents a first attempt to include this information in more perceptually motivated measures for evaluating transcription systems.
Type de document :
Communication dans un congrès
Proc. Int. Conf. Music Information Retrieval (ISMIR), Sep 2008, Philadelphia, PA, United States. 2008, 〈http://ismir2008.ismir.net/papers/ISMIR2008_203.pdf〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00452615
Contributeur : Valentin Emiya <>
Soumis le : mardi 2 février 2010 - 16:56:18
Dernière modification le : jeudi 11 janvier 2018 - 06:23:38
Document(s) archivé(s) le : vendredi 18 juin 2010 - 18:26:46

Fichier

ISMIR2008_203.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : inria-00452615, version 1

Citation

Adrien Daniel, Valentin Emiya, Bertrand David. Perceptually-based evaluation of the errors usually made when automatically transcribing music. Proc. Int. Conf. Music Information Retrieval (ISMIR), Sep 2008, Philadelphia, PA, United States. 2008, 〈http://ismir2008.ismir.net/papers/ISMIR2008_203.pdf〉. 〈inria-00452615〉

Partager

Métriques

Consultations de la notice

285

Téléchargements de fichiers

116