Asynchrony in Multi-Band Speech Recognition

Christophe Cerisara 1 Dominique Fohr 1 Jean-Paul Haton 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, an algorithm for continuous speech recognition systems based on the Multi-Band principle is proposed. This algorithm allows the bands to be asynchronous and has a practical complexity that is very close to the complexity of the classical Viterbi algorithm. The question of whether the bands should be constrained to be synchronous or not is discussed. We show that it is advantageous to let the bands asynchronous, as the increase of complexity, compared to the Viterbi algorithm, is low with our algorithm. Moreover, the accuracy must be at least as good as when the bands are synchronous, and, more importantly, different models than phones, can be used in the bands. In this paper, an algorithm for continuous speech recognition systems based on the Multi-Band principle is proposed. This algorithm allows the bands to be asynchronous and has a practical complexity that is very close to the complexity of the classical Viterbi algorithm. The question of whether the bands should be constrained to be synchronous or not is discussed. We show that it is advantageous to let the bands asynchronous, as the increase of complexity, compared to the Viterbi algorithm, is low with our algorithm. Moreover, the accuracy must be at least as good as when the bands are synchronous, and, more importantly, different models than phones, can be used in the bands.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/inria-00107848
Contributor : Publications Loria <>
Submitted on : Thursday, October 19, 2006 - 9:11:44 AM
Last modification on : Tuesday, September 24, 2019 - 4:00:12 PM
Long-term archiving on : Wednesday, March 29, 2017 - 1:09:20 PM

Identifiers

  • HAL Id : inria-00107848, version 1

Collections

Citation

Christophe Cerisara, Dominique Fohr, Jean-Paul Haton. Asynchrony in Multi-Band Speech Recognition. IEEE International Conference on Acoustics, Speech, & Signal Processing - ICASSP'2000, 2000, Istanbul, Turkey, 4 p. ⟨inria-00107848⟩

Share

Metrics

Record views

233

Files downloads

63