THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE

Abstract : This paper proposes an in-depth look at the influence of different speech and audio codecs on the performance of our continuous speech recognition engine. GSM full rate, G711, G723.1 and MPEG coders are investigated. It is shown that MPEG transcoding degrades the speech recognition performance for low bitrates whereas performance remains acceptable for specialized speech coders like GSM or G711. A new strategy is proposed to cope with degradation due to low bitrate coding. The acoustic models of the speech recognition system are trained with transcoded speech (one acoustic model for each speech / audio codec). First results show that this strategy allows to recover acceptable performance.
Type de document :
Communication dans un congrès
IEEE Multimedia Signal Processing Workshop, Oct 2001, Cannes, France. pp. 301-306, 2001
Liste complète des métadonnées

Littérature citée [7 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00326165
Contributeur : Dominique Vaufreydaz <>
Soumis le : jeudi 2 octobre 2008 - 09:14:34
Dernière modification le : jeudi 2 octobre 2008 - 20:39:13
Document(s) archivé(s) le : vendredi 4 juin 2010 - 12:05:53

Fichier

Besacier01b.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00326165, version 1

Citation

Laurent Besacier, Carole Bergamini, Dominique Vaufreydaz, Eric Castelli. THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE. IEEE Multimedia Signal Processing Workshop, Oct 2001, Cannes, France. pp. 301-306, 2001. 〈inria-00326165〉

Partager

Métriques

Consultations de la notice

120

Téléchargements de fichiers

555