Active hearing, active speaking

Martin Cooke 1 Yan-Chen Lu 1 Youyi Lu 1 Radu Horaud 2
2 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : A static view of the world permeates most research in speech and hearing. In this idealised situation, sources don't move and neither do listeners; the acoustic environment doesn't change; and speakers speak without any effect of auditory input from their own voice or other speakers. Corpora for speech research and most behavioural tasks have grown to reflect the static viewpoint. Yet it is clear that speech and hearing takes place in a world where none of the static assumptions hold, or at least not for long. The dynamic view complicates traditional signal processing approaches, and renders conventional evaluation processes unrepeatable since the observer's dynamics influence the signals received at the ears. However, the dynamic viewpoint also provides many opportunities for active processes to exploit. Some of these, such as the use of head movements to resolve front-back confusions, are well-known, while others exist solely as hypotheses. This paper reviews known and potential benefits of active processes in both hearing and speech production, and goes on to describe two recent studies which demonstrate the value of such processes. The first shows how dynamic cues can be used to estimate distance in an acoustic environment. The second demonstrates that the changes in speech production which take place when other speakers are active result in increased glimpsing opportunities at the ear of the interlocutor.
Type de document :
Communication dans un congrès
T. Dau and J. M. Buchholz and J. M. Harte and T. U. Christiansen. ISAAR 2007 - International Symposium on Auditory and Audiological Research, Aug 2007, Helsingor, Denmark. pp.33-46, 2008
Liste complète des métadonnées

Littérature citée [31 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00590228
Contributeur : Team Perception <>
Soumis le : mardi 3 mai 2011 - 09:46:34
Dernière modification le : jeudi 11 janvier 2018 - 01:48:44
Document(s) archivé(s) le : jeudi 4 août 2011 - 03:07:10

Fichier

cooke_isaar2007.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00590228, version 1

Collections

Citation

Martin Cooke, Yan-Chen Lu, Youyi Lu, Radu Horaud. Active hearing, active speaking. T. Dau and J. M. Buchholz and J. M. Harte and T. U. Christiansen. ISAAR 2007 - International Symposium on Auditory and Audiological Research, Aug 2007, Helsingor, Denmark. pp.33-46, 2008. 〈inria-00590228〉

Partager

Métriques

Consultations de la notice

462

Téléchargements de fichiers

197