Non-linear vector interpolation by neural network for phoneme identification in continuous speech

Yifan Gong; Jean-Paul Haton

Rapport (Rapport De Recherche) Année : 1991

Non-linear vector interpolation by neural network for phoneme identification in continuous speech

(1) , (1)

Yifan Gong

Fonction : Auteur

INRIA Lorraine

Jean-Paul Haton

Fonction : Auteur
PersonId : 830987

INRIA Lorraine

Résumé

The coorelation between vectors in a sequence of analysis frames are supposed to be specific to phonetic units in acoustic-phonetic decoding of speech. We propose non-linear vector interpolation techniques to represent this correlation and to recognize phonemes. The interpolation is based on the decomposition of a frame sequence into two parts and on the construction of a function that interpolates one part using information from the second part. According to quantities to be interpolated, three families of interpolator models are developed. In a recognition system, each phonetic symbol is associated with a non-linear vector interpolator which is trained to give minimum interpolation error for that specific phoneme. Multi-layer feedforward neural networks are used to implement the non-linear vector interpolators. For a continuous speech phoneme spotting test using 16 LPCC-derived cepstrum coefficients as parametric vectors, the three categories of models gave compatible results. Vector-pair interpolator models yielded best recognition rate. Compared to a VQ-coded reference technique, this model gives close global recognition rate and significatly outperforms for plosive sounds.

Domaines

Autre [cs.OH]

Fichier principal

RR-1457.pdf (721.91 Ko)

Rapport De Recherche Inria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00075104

Soumis le : mercredi 24 mai 2006-17:25:41

Dernière modification le : mardi 7 février 2023-03:40:54

Archivage à long terme le : mardi 12 avril 2011-21:09:14

Dates et versions

inria-00075104 , version 1 (24-05-2006)

Identifiants

HAL Id : inria-00075104 , version 1

Citer

Yifan Gong, Jean-Paul Haton. Non-linear vector interpolation by neural network for phoneme identification in continuous speech. [Research Report] RR-1457, INRIA. 1991. ⟨inria-00075104⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA-RRRT INRIA2 LARA

81 Consultations

95 Téléchargements

Non-linear vector interpolation by neural network for phoneme identification in continuous speech

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager