inria-00100828, version 1
Dynamic Topic Identification : Introduction of Trigger pairs in the Cache Model
International Workshop Speech and Computer 2002 - SPECOM'2002 (2002) 4 p
Résumé : This paper focuses on dynamic topic identification for adaptive statistical language modeling in automatic speech recognition. It proposes a more correct solution of the cache model presented in from a mathematical point of view, which constitutes at the same time a simplification and an improvement of the model. Moreover, an original solution is put forward to overcome some limitations of this model. This new solution proposes to take into account the underlying semantic concepts of the text by introducing triggers of words into the cache memory. The relative identification accuracy is assessed on a newspaper corpus, highlighting a small increase of identification rate. In practice, the original cache model achieves an identification rate of 79.5%. Making use of the triggers, the feasibility of the task is investigated by an improvement of 1.2% in identification rate. This study is thus devoted to the specification of a possible direction to perform well dynamic topic identification for automatic speech recognition.
- a – UNIVERSITE NANCY 2
- b – UNIVERSITE HENRI POINCARE
- 1 :
- INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- Domaine : Informatique/Autre
- Mots-clés : reconnaissance de la parole – modèles de langage statistiques – identification du thème – modèle cache – triggers || speech recognition – statistical language models – topic identification – cache model – triggers
- Référence interne : A02-R-182 || bigi02a
- Commentaire : Colloque avec actes et comité de lecture. internationale.
- inria-00100828, version 1
- http://hal.inria.fr/inria-00100828
- oai:hal.inria.fr:inria-00100828
- Contributeur :
- Soumis le : Mardi 26 Septembre 2006, 14:52:09
- Dernière modification le : Jeudi 18 Janvier 2007, 12:10:54


Exporter