Automatic Speech Recognition: An Improved Paradigm

Tudor-Sabin Topoleanu; Gheorghe Leonte Mogan

doi:10.1007/978-3-642-19170-1_29

Communication Dans Un Congrès Année : 2011

Automatic Speech Recognition: An Improved Paradigm

(1) , (1)

Tudor-Sabin Topoleanu

Fonction : Auteur
PersonId : 1013339

Transilvania University of Brasov

Gheorghe Leonte Mogan

Fonction : Auteur
PersonId : 988830

Transilvania University of Brasov

Résumé

In this paper we present a short survey of automatic speech recognition systems underlining the current achievements and capabilities of current day solutions as well as their inherent limitations and shortcomings. In response to which we propose an improved paradigm and algorithm for building an automatic speech recognition system that actively adapts its recognition model in an unsupervised fashion by listening to continuous human speech. The paradigm relies on creating a semi-autonomous system that samples continuous human speech in order to record phonetic units. Then processes those phoneme sized samples to identify the degree of similarity of each sample that will allow the detection of the same phoneme across many samples. After a sufficiently large database of samples has been gathered the system clusters the samples based on their degree of similarity, creating a different cluster for each phoneme. After that the system trains one neural network for each cluster using the samples in that cluster. After a few iterations of sampling, processing, clustering and training the system should contain a neural network detector for each phoneme unit of the spoken language that the system has been exposed to, and be able to use these detectors to recognize phonemes from live speech. Finally we provide the structure and algorithms for this novel automatic speech recognition paradigm.

Mots clés

automatic speech recognition natural language processing probabilistic language acquisition unsupervised learning of speech

Domaines

Informatique [cs]

Fichier principal

978-3-642-19170-1_29_Chapter.pdf (89.61 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01566588

Soumis le : vendredi 21 juillet 2017-11:25:43

Dernière modification le : vendredi 21 juillet 2017-11:30:44

Dates et versions

hal-01566588 , version 1 (21-07-2017)

Licence

Paternité

Identifiants

HAL Id : hal-01566588 , version 1
DOI : 10.1007/978-3-642-19170-1_29

Citer

Tudor-Sabin Topoleanu, Gheorghe Leonte Mogan. Automatic Speech Recognition: An Improved Paradigm. 2nd Doctoral Conference on Computing, Electrical and Industrial Systems (DoCEIS), Feb 2011, Costa de Caparica, Portugal. pp.269-276, ⟨10.1007/978-3-642-19170-1_29⟩. ⟨hal-01566588⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-TC5 IFIP-WG IFIP-WG5-5 IFIP-DOCEIS IFIP-AICT-349

59 Consultations

105 Téléchargements

Automatic Speech Recognition: An Improved Paradigm

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager