Automatic enhancement of speech intelligibility

Vincent Colotte 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a speech signal transformation which slows down speech signals selectively and enhances some important acoustic cues. This transformation can be used not only for hearing aids but also for second language acquisition by facilitating oral comprehension. Selective slowing down relies on the use the TD-PSOLA synthesis method. An automatic pitch marking algorithm was designed to apply this method automatically. The strategy used to control slowing down exploits a spectral variation function which locates rapid spectral changes. The enhancement simply consists of amplifying stop bursts and unvoiced fricatives. These acoustic cues are detected automatically through the examination of energy criteria. This approach was evaluated in the context of second language acquisition, more precisely by evaluating improvements in oral comprehension. Transformations trigerred properly, i.e. the signal regions modified are those which were expected to be modified. Experiments show that the oral comprehension is improved.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech, & Signal Processing - ICASSP'2000, 2000, Istanbul, Turkey, 2000
Liste complète des métadonnées

https://hal.inria.fr/inria-00099321
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 08:52:45
Dernière modification le : jeudi 11 janvier 2018 - 06:19:57

Identifiants

  • HAL Id : inria-00099321, version 1

Collections

Citation

Vincent Colotte, Yves Laprie. Automatic enhancement of speech intelligibility. IEEE International Conference on Acoustics, Speech, & Signal Processing - ICASSP'2000, 2000, Istanbul, Turkey, 2000. 〈inria-00099321〉

Partager

Métriques

Consultations de la notice

262