Skip to Main content Skip to Navigation

On temporal coherency of probabilistic models for audio-to-score alignment

Abstract : This thesis deals with automatic alignment of audio recordings with corresponding music scores. We study algorithmic solutions for this problem in the framework of probabilistic models which represent hidden evolution on the music score as stochastic process. We begin this work by investigating theoretical foundations of the design of such models. To do so, we undertake an axiomatic approach which is based on an application peculiarity: music scores provide nominal duration for each event, which is a hint for the actual and unknown duration. Thus, modeling this specific temporal structure through stochastic processes is our main problematic. We define temporal coherency as compliance with such prior information and refine this abstract notion by stating two criteria of coherency. Focusing on hidden semi-Markov models, we demonstrate that coherency is guaranteed by specific mathematical conditions on the probabilistic design and that fulfilling these prescriptions significantly improves precision of alignment algorithms. Such conditions are derived by combining two fields of mathematics, Lévy processes and total positivity of order 2. This is why the second part of this work is a theoretical investigation which extends existing results in the related literature.
Document type :
Complete list of metadata

Cited literature [139 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Tuesday, May 23, 2017 - 10:16:08 AM
Last modification on : Saturday, January 9, 2021 - 5:40:47 PM
Long-term archiving on: : Friday, August 25, 2017 - 12:26:25 AM


Version validated by the jury (STAR)


  • HAL Id : tel-01448687, version 2


Philippe Cuvillier. On temporal coherency of probabilistic models for audio-to-score alignment. Sound [cs.SD]. Université Pierre et Marie Curie - Paris VI, 2016. English. ⟨NNT : 2016PA066532⟩. ⟨tel-01448687v2⟩



Record views


Files downloads