Multi-band automatic speech recognition

Christophe Cerisara 1 Dominique Fohr 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psycho-acoustic work of Fletcher (1953) who proposed a similar principle for the human auditory system. Jont B. Allen published a paper in 1994 in which he summarized the work of Fletcher and also proposed to adapt the multi-band paradigm to automatic speech recognition (ASR) (Allen, 1994). Many researchers have then studied this principle and built such ASR systems. The goal of this paper is to analyse some of the most important issues in the design of a multi-band ASR system in order to determine which architecture it should have in which environment. Two other major problems are then considered: how to train multi-band systems and how to use them for continuous ASR.
Type de document :
Article dans une revue
Computer Speech and Language, Elsevier, 2001, 15 (2), pp.151-174
Liste complète des métadonnées

https://hal.inria.fr/inria-00101094
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 14:56:27
Dernière modification le : vendredi 9 février 2018 - 13:20:01

Identifiants

  • HAL Id : inria-00101094, version 1

Collections

Citation

Christophe Cerisara, Dominique Fohr. Multi-band automatic speech recognition. Computer Speech and Language, Elsevier, 2001, 15 (2), pp.151-174. 〈inria-00101094〉

Partager

Métriques

Consultations de la notice

143