Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection

Résumé

This paper addresses the problem of segmenting small group meetings in order to detect different group configurations and activities in an intelligent environment. Our approach takes speech activity detection of individuals attending a meeting as input. The goal is to separate distinct distributions of speech activity observation corresponding to distinct group configurations and activities. We propose an unsupervised method based on the calculation of the Jeffrey divergence between histograms of speech activity observations. These histograms are generated from adjacent windows of variable size slid from the beginning to the end of a meeting recording. The peaks of the resulting Jeffrey divergence curves are detected using successive robust mean estimation. After a merging and filtering process, the retained peaks are used to select the best model, i.e. the best speech activity distribution allocation for a given meeting recording. These distinct distributions can be interpreted as distinct segments of group configuration and activity. To evaluate, we recorded 6 small group meetings. We measured the correspondence between detected segments and labeled group configurations and activities. The obtained results are promising, in particular as our method is completely unsupervised.
Fichier principal
Vignette du fichier
Brdiczka06b.pdf (271.65 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00326527 , version 1 (03-10-2008)

Identifiants

  • HAL Id : inria-00326527 , version 1

Citer

Oliver Brdiczka, Dominique Vaufreydaz, Jérôme Maisonnasse, Patrick Reignier. Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection. 3rd IFIP Conference on Artificial Intelligence Applications & Innovations (AIAI), Jun 2006, Athens, Greece. ⟨inria-00326527⟩
174 Consultations
140 Téléchargements

Partager

Gmail Facebook X LinkedIn More