Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection

Oliver Brdiczka 1 Dominique Vaufreydaz 1 Jérôme Maisonnasse 1 Patrick Reignier 1
1 PRIMA - Perception, recognition and integration for observation of activity
Inria Grenoble - Rhône-Alpes, UJF - Université Joseph Fourier - Grenoble 1, INPG - Institut National Polytechnique de Grenoble , CNRS - Centre National de la Recherche Scientifique : UMR5217
Abstract : This paper addresses the problem of segmenting small group meetings in order to detect different group configurations and activities in an intelligent environment. Our approach takes speech activity detection of individuals attending a meeting as input. The goal is to separate distinct distributions of speech activity observation corresponding to distinct group configurations and activities. We propose an unsupervised method based on the calculation of the Jeffrey divergence between histograms of speech activity observations. These histograms are generated from adjacent windows of variable size slid from the beginning to the end of a meeting recording. The peaks of the resulting Jeffrey divergence curves are detected using successive robust mean estimation. After a merging and filtering process, the retained peaks are used to select the best model, i.e. the best speech activity distribution allocation for a given meeting recording. These distinct distributions can be interpreted as distinct segments of group configuration and activity. To evaluate, we recorded 6 small group meetings. We measured the correspondence between detected segments and labeled group configurations and activities. The obtained results are promising, in particular as our method is completely unsupervised.
Type de document :
Communication dans un congrès
3rd IFIP Conference on Artificial Intelligence Applications & Innovations (AIAI), Jun 2006, Athens, Greece. 2006
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00326527
Contributeur : Dominique Vaufreydaz <>
Soumis le : vendredi 3 octobre 2008 - 12:27:32
Dernière modification le : mercredi 11 avril 2018 - 01:51:41
Document(s) archivé(s) le : vendredi 4 juin 2010 - 12:10:16

Fichier

Brdiczka06b.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00326527, version 1

Collections

Citation

Oliver Brdiczka, Dominique Vaufreydaz, Jérôme Maisonnasse, Patrick Reignier. Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection. 3rd IFIP Conference on Artificial Intelligence Applications & Innovations (AIAI), Jun 2006, Athens, Greece. 2006. 〈inria-00326527〉

Partager

Métriques

Consultations de la notice

300

Téléchargements de fichiers

147