Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Computer Science and Communication Engineering Année : 2014

Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation

Résumé

—Word Confidence Estimation (WCE) is the task of predicting the correct and incorrect words in the MT output. Dealing with this problem, this paper proposes some ideas to build a binary estimator and then enhance its prediction capability. We integrate a number of features of various types (system-based, lexical, syntactic and semantic) into the conventional feature set, to build our classifier. After the experiment with all features, we deploy a " Feature Selection " strategy to filter the best performing ones. Next, we propose a method that combines multiple " weak " classifiers to build a strong " composite " classifier by taking advantage of their complementarity. Experimental results show that our propositions helped to achieve a better performance in term of F-score. Finally, we test whether WCE output can play any role in improving the sentence level confidence estimation system.
Fichier principal
Vignette du fichier
bare_jrnl.pdf (324.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01002931 , version 1 (23-02-2018)

Identifiants

  • HAL Id : hal-01002931 , version 1

Citer

Ngoc-Quang Luong, Laurent Besacier, Benjamin Lecouteux. Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation. Journal of Computer Science and Communication Engineering, 2014. ⟨hal-01002931⟩
180 Consultations
32 Téléchargements

Partager

Gmail Facebook X LinkedIn More