Frequency and Wavelet Filtering for Robust Speech Recognition

Murat Deviren; Khalid Daoudi

Communication Dans Un Congrès Année : 2003

Frequency and Wavelet Filtering for Robust Speech Recognition

(1) , (1)

Murat Deviren

Fonction : Auteur
PersonId : 835411

Analysis, perception and recognition of speech

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Analysis, perception and recognition of speech

Résumé

Mel-frequency cepstral coefficients (MFCC) are the most widely used features in current speech recognition systems. However, they have a poor physical interpretation and they do not lie in the frequency domain. Frequency filtering (FF) is a technique that has been recently developed to design frequency-localized speech features that perform similar to MFCC in terms of recognition performances. Motivated by our desire to build time-frequency speech models, we wanted to use the FF technique as front-end. However, when evaluating FF on the Aurora-3 database we found some discrepancies in the highly mismatch case. This led us to put FF in another perspective: the wavelet transform. By doing so, we were able to explain the discrepancies and to achieve significant improvements in recognition in the highly mismatch case.

Mots clés

wavelets reconnaissance de la parole speech recognition noise robustness frequency filtering robustesse au bruit ondelettes

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00099753

Soumis le : mardi 26 septembre 2006-09:40:57

Dernière modification le : jeudi 1 février 2024-10:05:01

Dates et versions

inria-00099753 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00099753 , version 1

Citer

Murat Deviren, Khalid Daoudi. Frequency and Wavelet Filtering for Robust Speech Recognition. Artificial Neural Networks and Neural Information Processing - Joint International Conference ICANN/ICONIP2003, 2003, Istanbul, Turquie, pp.452-462. ⟨inria-00099753⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

56 Consultations

0 Téléchargements

Frequency and Wavelet Filtering for Robust Speech Recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager