Multi-band automatic speech recognition

Christophe Cerisara; Dominique Fohr

Article Dans Une Revue Computer Speech and Language Année : 2001

Multi-band automatic speech recognition

(1) , (1)

Christophe Cerisara

Fonction : Auteur
PersonId : 2353
IdHAL : christophe-cerisara
IdRef : 102700168

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Résumé

This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psycho-acoustic work of Fletcher (1953) who proposed a similar principle for the human auditory system. Jont B. Allen published a paper in 1994 in which he summarized the work of Fletcher and also proposed to adapt the multi-band paradigm to automatic speech recognition (ASR) (Allen, 1994). Many researchers have then studied this principle and built such ASR systems. The goal of this paper is to analyse some of the most important issues in the design of a multi-band ASR system in order to determine which architecture it should have in which environment. Two other major problems are then considered: how to train multi-band systems and how to use them for continuous ASR.

Mots clés

speech recognition multi-bandes multi-band reconnaissance de la parole

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00101094

Soumis le : mardi 26 septembre 2006-14:56:27

Dernière modification le : vendredi 24 mars 2023-14:52:48

Dates et versions

inria-00101094 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00101094 , version 1

Citer

Christophe Cerisara, Dominique Fohr. Multi-band automatic speech recognition. Computer Speech and Language, 2001, 15 (2), pp.151-174. ⟨inria-00101094⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

75 Consultations

0 Téléchargements

Multi-band automatic speech recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager