Skip to Main content Skip to Navigation
Conference papers

Perceptually-based evaluation of the errors usually made when automatically transcribing music

Abstract : This paper investigates the perceptual importance of typical errors occurring when transcribing polyphonic music excerpts into a symbolic form. The case of the automatic transcription of piano music is taken as the target application and two subjective tests are designed. The main test aims at understanding how human subjects rank typical transcription errors such as note insertion, deletion or replacement, note doubling, incorrect note onset or duration, and so forth. The Bradley-Terry-Luce (BTL) analysis framework is used and the results show that pitch errors are more clearly perceived than incorrect loudness estimations or temporal deviations from the original recording. A second test presents a first attempt to include this information in more perceptually motivated measures for evaluating transcription systems.
Complete list of metadatas

https://hal.inria.fr/inria-00452615
Contributor : Valentin Emiya <>
Submitted on : Tuesday, February 2, 2010 - 4:56:18 PM
Last modification on : Tuesday, December 8, 2020 - 10:21:47 AM
Long-term archiving on: : Friday, June 18, 2010 - 6:26:46 PM

File

ISMIR2008_203.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : inria-00452615, version 1

Citation

Adrien Daniel, Valentin Emiya, Bertrand David. Perceptually-based evaluation of the errors usually made when automatically transcribing music. Proc. Int. Conf. Music Information Retrieval (ISMIR), Sep 2008, Philadelphia, PA, United States. ⟨inria-00452615⟩

Share

Metrics

Record views

364

Files downloads

227