Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection

Oliver Brdiczka 1 Dominique Vaufreydaz 1 Jérôme Maisonnasse 1 Patrick Reignier 1
1 PRIMA - Perception, recognition and integration for observation of activity
Inria Grenoble - Rhône-Alpes, UJF - Université Joseph Fourier - Grenoble 1, INPG - Institut National Polytechnique de Grenoble , CNRS - Centre National de la Recherche Scientifique : UMR5217
Abstract : This paper addresses the problem of segmenting small group meetings in order to detect different group configurations and activities in an intelligent environment. Our approach takes speech activity detection of individuals attending a meeting as input. The goal is to separate distinct distributions of speech activity observation corresponding to distinct group configurations and activities. We propose an unsupervised method based on the calculation of the Jeffrey divergence between histograms of speech activity observations. These histograms are generated from adjacent windows of variable size slid from the beginning to the end of a meeting recording. The peaks of the resulting Jeffrey divergence curves are detected using successive robust mean estimation. After a merging and filtering process, the retained peaks are used to select the best model, i.e. the best speech activity distribution allocation for a given meeting recording. These distinct distributions can be interpreted as distinct segments of group configuration and activity. To evaluate, we recorded 6 small group meetings. We measured the correspondence between detected segments and labeled group configurations and activities. The obtained results are promising, in particular as our method is completely unsupervised.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/inria-00326527
Contributor : Dominique Vaufreydaz <>
Submitted on : Friday, October 3, 2008 - 12:27:32 PM
Last modification on : Tuesday, November 27, 2018 - 8:46:04 PM
Long-term archiving on : Friday, June 4, 2010 - 12:10:16 PM

File

Brdiczka06b.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00326527, version 1

Collections

Citation

Oliver Brdiczka, Dominique Vaufreydaz, Jérôme Maisonnasse, Patrick Reignier. Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection. 3rd IFIP Conference on Artificial Intelligence Applications & Innovations (AIAI), Jun 2006, Athens, Greece. ⟨inria-00326527⟩

Share

Metrics

Record views

348

Files downloads

221