Skip to Main content Skip to Navigation
Conference papers

On The Use of High Order Derivatives for High Performance Alphabet Recognition

Joseph Di Martino 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper I propose new feature vectors for automatic speech recognition. They are based on Mel-cepstrum vectors augmented by derivatives. In the literature, many systems using just two derivatives ---delta and delta delta--- are described. But none explores the use of higher order derivatives. This paper presents alphabet recognition results on the Isolet database, using feature vectors containing up to the fifth-order derivatives. For this paper I did not use the HTK toolkit proposed by Cambridge University. I developed my own HMM system. I show that with vectors incorporating all the derivatives up to the fifth one, 97.54% mean recognition accuracy was achieved, result which is comparable to the best published one on this database (97.6%), if the recognition accuracy confidence interval concerning this task (approximately 0.3\%) is taken into account. It is important to note that this result was obtained without segmenting the speech files by an endpoint detection algorithm. This is an unfavourable experimental condition compared to previous published research works. As a consequence, my system is one of the most powerful systems ever implemented for alphabet recognition.
Document type :
Conference papers
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/inria-00099412
Contributor : Joseph Di Martino <>
Submitted on : Tuesday, June 18, 2013 - 9:25:31 AM
Last modification on : Thursday, January 11, 2018 - 6:19:57 AM
Document(s) archivé(s) le : Thursday, September 19, 2013 - 2:25:14 AM

Identifiers

  • HAL Id : inria-00099412, version 1

Collections

Citation

Joseph Di Martino. On The Use of High Order Derivatives for High Performance Alphabet Recognition. International Conference on Acoustics Speech and Signal Processing - ICASSP 2002, 2002, Orlando, Florida, United States. 4 p. ⟨inria-00099412⟩

Share

Metrics

Record views

290

Files downloads

182