Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation - Archive ouverte HAL Access content directly
Journal Articles Journal of Computer Science and Communication Engineering Year : 2014

Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation

(1) , (1) , (1)
1

Abstract

—Word Confidence Estimation (WCE) is the task of predicting the correct and incorrect words in the MT output. Dealing with this problem, this paper proposes some ideas to build a binary estimator and then enhance its prediction capability. We integrate a number of features of various types (system-based, lexical, syntactic and semantic) into the conventional feature set, to build our classifier. After the experiment with all features, we deploy a " Feature Selection " strategy to filter the best performing ones. Next, we propose a method that combines multiple " weak " classifiers to build a strong " composite " classifier by taking advantage of their complementarity. Experimental results show that our propositions helped to achieve a better performance in term of F-score. Finally, we test whether WCE output can play any role in improving the sentence level confidence estimation system.
Fichier principal
Vignette du fichier
bare_jrnl.pdf (324.33 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01002931 , version 1 (23-02-2018)

Identifiers

  • HAL Id : hal-01002931 , version 1

Cite

Ngoc-Quang Luong, Laurent Besacier, Benjamin Lecouteux. Some Propositions to Improve the Prediction Capability of Word Confidence Estimation for Machine Translation. Journal of Computer Science and Communication Engineering, 2014. ⟨hal-01002931⟩
172 View
27 Download

Share

Gmail Facebook Twitter LinkedIn More