Active hearing, active speaking

Martin Cooke 1 Yan-Chen Lu 1 Youyi Lu 1 Radu Horaud 2
2 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : A static view of the world permeates most research in speech and hearing. In this idealised situation, sources don't move and neither do listeners; the acoustic environment doesn't change; and speakers speak without any effect of auditory input from their own voice or other speakers. Corpora for speech research and most behavioural tasks have grown to reflect the static viewpoint. Yet it is clear that speech and hearing takes place in a world where none of the static assumptions hold, or at least not for long. The dynamic view complicates traditional signal processing approaches, and renders conventional evaluation processes unrepeatable since the observer's dynamics influence the signals received at the ears. However, the dynamic viewpoint also provides many opportunities for active processes to exploit. Some of these, such as the use of head movements to resolve front-back confusions, are well-known, while others exist solely as hypotheses. This paper reviews known and potential benefits of active processes in both hearing and speech production, and goes on to describe two recent studies which demonstrate the value of such processes. The first shows how dynamic cues can be used to estimate distance in an acoustic environment. The second demonstrates that the changes in speech production which take place when other speakers are active result in increased glimpsing opportunities at the ear of the interlocutor.
Document type :
Conference papers
Complete list of metadatas

Cited literature [31 references]  Display  Hide  Download

https://hal.inria.fr/inria-00590228
Contributor : Team Perception <>
Submitted on : Tuesday, May 3, 2011 - 9:46:34 AM
Last modification on : Wednesday, April 11, 2018 - 1:59:05 AM
Long-term archiving on : Thursday, August 4, 2011 - 3:07:10 AM

File

cooke_isaar2007.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00590228, version 1

Collections

Citation

Martin Cooke, Yan-Chen Lu, Youyi Lu, Radu Horaud. Active hearing, active speaking. ISAAR 2007 - International Symposium on Auditory and Audiological Research, Aug 2007, Helsingor, Denmark. pp.33-46. ⟨inria-00590228⟩

Share

Metrics

Record views

555

Files downloads

356