Asynchrony in Multi-Band Speech Recognition

Christophe Cerisara 1 Dominique Fohr 1 Jean-Paul Haton 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, an algorithm for continuous speech recognition systems based on the Multi-Band principle is proposed. This algorithm allows the bands to be asynchronous and has a practical complexity that is very close to the complexity of the classical Viterbi algorithm. The question of whether the bands should be constrained to be synchronous or not is discussed. We show that it is advantageous to let the bands asynchronous, as the increase of complexity, compared to the Viterbi algorithm, is low with our algorithm. Moreover, the accuracy must be at least as good as when the bands are synchronous, and, more importantly, different models than phones, can be used in the bands. In this paper, an algorithm for continuous speech recognition systems based on the Multi-Band principle is proposed. This algorithm allows the bands to be asynchronous and has a practical complexity that is very close to the complexity of the classical Viterbi algorithm. The question of whether the bands should be constrained to be synchronous or not is discussed. We show that it is advantageous to let the bands asynchronous, as the increase of complexity, compared to the Viterbi algorithm, is low with our algorithm. Moreover, the accuracy must be at least as good as when the bands are synchronous, and, more importantly, different models than phones, can be used in the bands.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech, & Signal Processing - ICASSP'2000, 2000, Istanbul, Turkey, 4 p, 2000
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00107848
Contributeur : Publications Loria <>
Soumis le : jeudi 19 octobre 2006 - 09:11:44
Dernière modification le : vendredi 9 février 2018 - 13:20:05
Document(s) archivé(s) le : mercredi 29 mars 2017 - 13:09:20

Identifiants

  • HAL Id : inria-00107848, version 1

Collections

Citation

Christophe Cerisara, Dominique Fohr, Jean-Paul Haton. Asynchrony in Multi-Band Speech Recognition. IEEE International Conference on Acoustics, Speech, & Signal Processing - ICASSP'2000, 2000, Istanbul, Turkey, 4 p, 2000. 〈inria-00107848〉

Partager

Métriques

Consultations de la notice

212

Téléchargements de fichiers

34