HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

An online EM algorithm in hidden (semi-)Markov models for audio segmentation and clustering

Alberto Bietti 1 Francis Bach 2, 3 Arshia Cont 4, 1
1 MuTant - Synchronous Realtime Processing and Programming of Music Signals
Inria Paris-Rocquencourt, UPMC - Université Pierre et Marie Curie - Paris 6, IRCAM - Institut de Recherche et Coordination Acoustique/Musique, CNRS - Centre National de la Recherche Scientifique
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique - ENS Paris, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
4 Repmus - Représentations musicales
STMS - Sciences et Technologies de la Musique et du Son
Abstract : Audio segmentation is an essential problem in many audio signal processing tasks, which tries to segment an audio signal into homogeneous chunks. Rather than separately finding change points and computing similarities between segments, we focus on joint segmentation and clustering, using the framework of hidden Markov and semi-Markov models. We introduce a new incremental EM algorithm for hidden Markov models (HMMs) and show that it compares favorably to existing online EM algorithms for HMMs. We present results for real-time segmentation of musical notes and acoustic scenes.
Complete list of metadata

Cited literature [23 references]  Display  Hide  Download

Contributor : Alberto Bietti Connect in order to contact the contributor
Submitted on : Wednesday, February 11, 2015 - 7:18:01 PM
Last modification on : Thursday, March 17, 2022 - 10:08:44 AM
Long-term archiving on: : Thursday, May 28, 2015 - 11:31:20 AM


Publisher files allowed on an open archive


  • HAL Id : hal-01115826, version 1


Alberto Bietti, Francis Bach, Arshia Cont. An online EM algorithm in hidden (semi-)Markov models for audio segmentation and clustering. ICASSP 2015 - 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Australia. ⟨hal-01115826⟩



Record views


Files downloads