An online EM algorithm in hidden (semi-)Markov models for audio segmentation and clustering

Alberto Bietti 1 Francis Bach 2, 3 Arshia Cont 4, 1
1 MuTant - Synchronous Realtime Processing and Programming of Music Signals
Inria Paris-Rocquencourt, UPMC - Université Pierre et Marie Curie - Paris 6, IRCAM, CNRS - Centre National de la Recherche Scientifique
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, ENS Paris - École normale supérieure - Paris, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
4 Repmus - Représentations musicales
STMS - Sciences et Technologies de la Musique et du Son
Abstract : Audio segmentation is an essential problem in many audio signal processing tasks, which tries to segment an audio signal into homogeneous chunks. Rather than separately finding change points and computing similarities between segments, we focus on joint segmentation and clustering, using the framework of hidden Markov and semi-Markov models. We introduce a new incremental EM algorithm for hidden Markov models (HMMs) and show that it compares favorably to existing online EM algorithms for HMMs. We present results for real-time segmentation of musical notes and acoustic scenes.
Type de document :
Communication dans un congrès
ICASSP 2015 - 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Australia. 2015
Liste complète des métadonnées

Littérature citée [23 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01115826
Contributeur : Alberto Bietti <>
Soumis le : mercredi 11 février 2015 - 19:18:01
Dernière modification le : vendredi 25 mai 2018 - 12:02:06
Document(s) archivé(s) le : jeudi 28 mai 2015 - 11:31:20

Fichier

icassp_online_em_hmm.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-01115826, version 1

Collections

Citation

Alberto Bietti, Francis Bach, Arshia Cont. An online EM algorithm in hidden (semi-)Markov models for audio segmentation and clustering. ICASSP 2015 - 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Australia. 2015. 〈hal-01115826〉

Partager

Métriques

Consultations de la notice

944

Téléchargements de fichiers

2471