Syntactic Reanalysis in Language Models for Speech Recognition

Johannes Twiefel; Xavier Hinaut; Stefan Wermter

Communication Dans Un Congrès Année : 2017

Syntactic Reanalysis in Language Models for Speech Recognition

(1) , (1, 2) , (1)

1
2

Johannes Twiefel

Fonction : Auteur

Knowledge Technology group [Hamburg]

Xavier Hinaut

Fonction : Auteur
PersonId : 8171
IdHAL : xavier-hinaut
ORCID : 0000-0002-1924-1184
IdRef : 22823218X

Knowledge Technology group [Hamburg]

Mnemonic Synergy

Stefan Wermter

Fonction : Auteur

Knowledge Technology group [Hamburg]

Résumé

State-of-the-art speech recognition systems steadily increase their performance using different variants of deep neural networks and postprocess the results by employing N-gram statistical models trained on a large amount of data coming from the general-purpose domain. While achieving an excellent performance regarding Word Error Rate (17.343% on our Human-Robot Interaction data set), state-of-the-art systems generate hypotheses that are grammatically incorrect in 57.316% of the cases. Moreover, if employed in a restricted domain (e.g. Human-Robot Interaction), around 50% of the hypotheses contain out-of-domain words. The latter are confused with similarly pronounced in-domain words and cannot be interpreted by a domain-specific inference system. The state-of-the-art speech recognition systems lack a mechanism that addresses syntactic correctness of hypotheses. We propose a system that can detect and repair grammatically incorrect or infrequent sentence forms. It is inspired by a computational neuroscience model that we developed previously. The current system is still a proof-of-concept version of a future neurobiologically more plausible neural network model. Hence, the resulting system postprocesses sentence hypotheses of state-of-the-art speech recognition systems, producing in-domain words in 100% of the cases, syntactically and grammatically correct hypotheses in 90.319% of the cases. Moreover, it reduces the Word Error Rate to 11.038%.

Mots clés

natural language processing syntax N-gram speech recognition phoneme grapheme syntactic reanalysis domain-specific Human-Robot Interaction

Domaines

Linguistique Réseau de neurones [cs.NE] Robotique [cs.RO] Neurosciences [q-bio.NC] Informatique et langage [cs.CL]

Fichier principal

twiefel_ICDL_EpiRob_2017__generated_by_xav.pdf (259.42 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Xavier Hinaut : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01558462

Soumis le : vendredi 7 juillet 2017-17:47:20

Dernière modification le : jeudi 15 février 2024-03:31:23

Dates et versions

hal-01558462 , version 1 (07-07-2017)

hal-01558462 , version 2 (07-07-2017)

Identifiants

HAL Id : hal-01558462 , version 2

Citer

Johannes Twiefel, Xavier Hinaut, Stefan Wermter. Syntactic Reanalysis in Language Models for Speech Recognition. 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), Sep 2017, Lisbon, Portugal. ⟨hal-01558462v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

276 Consultations

337 Téléchargements

Syntactic Reanalysis in Language Models for Speech Recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager