Skip to Main content Skip to Navigation
Theses

On temporal coherency of probabilistic models for audio-to-score alignment

Abstract : This thesis deals with automatic alignment of audio recordings with corresponding music scores. We study algorithmic solutions for this problem in the framework of probabilistic models which represent hidden evolution on the music score as stochastic process. We begin this work by investigating theoretical foundations of the design of such models. To do so, we undertake an axiomatic approach which is based on an application peculiarity: music scores provide nominal duration for each event, which is a hint for the actual and unknown duration. Thus, modeling this specific temporal structure through stochastic processes is our main problematic. We define temporal coherency as compliance with such prior information and refine this abstract notion by stating two criteria of coherency. Focusing on hidden semi-Markov models, we demonstrate that coherency is guaranteed by specific mathematical conditions on the probabilistic design and that fulfilling these prescriptions significantly improves precision of alignment algorithms. Such conditions are derived by combining two fields of mathematics, Lévy processes and total positivity of order 2. This is why the second part of this work is a theoretical investigation which extends existing results in the related literature.
Document type :
Theses
Complete list of metadatas

Cited literature [139 references]  Display  Hide  Download

https://hal.inria.fr/tel-01448687
Contributor : Abes Star :  Contact
Submitted on : Tuesday, May 23, 2017 - 10:16:08 AM
Last modification on : Friday, May 29, 2020 - 4:00:05 PM
Long-term archiving on: : Friday, August 25, 2017 - 12:26:25 AM

File

these_archivage_3150918o.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01448687, version 2

Citation

Philippe Cuvillier. On temporal coherency of probabilistic models for audio-to-score alignment. Sound [cs.SD]. Université Pierre et Marie Curie - Paris VI, 2016. English. ⟨NNT : 2016PA066532⟩. ⟨tel-01448687v2⟩

Share

Metrics

Record views

993

Files downloads

392