Word Confidence Estimation for SMT N-best List Re-ranking - Archive ouverte HAL Access content directly
Conference Papers Year : 2014

Word Confidence Estimation for SMT N-best List Re-ranking

(1) , (1) , (1)
1

Abstract

This paper proposes to use Word Confidence Estimation (WCE) information to improve MT outputs via N-best list re-ranking. From the confidence label assigned for each word in the MT hypothesis , we add six scores to the baseline log-linear model in order to re-rank the N-best list. Firstly, the correlation between the WCE-based sentence-level scores and the conventional evaluation scores (BLEU, TER, TERp-A) is investigated. Then, the N-best list re-ranking is evaluated over different WCE system performance levels: from our real and efficient WCE system (ranked 1st during last WMT 2013 Quality Estimation Task) to an oracle WCE (which simulates an interactive scenario where a user simply validates words of a MT hypothesis and the new output will be automatically regenerated). The results suggest that our real WCE system slightly (but significantly) improves the baseline while the oracle one extremely boosts it; and better WCE leads to better MT quality.
no abstract
Fichier principal
Vignette du fichier
eacl2014_cameraready.pdf (332.82 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00953719 , version 1 (23-02-2018)

Identifiers

  • HAL Id : hal-00953719 , version 1

Cite

Ngoc-Quang Luong, Laurent Besacier, Benjamin Lecouteux. Word Confidence Estimation for SMT N-best List Re-ranking. Proceedings of the Workshop on Humans and Computer-assisted Translation (HaCaT) during EACL, 2014, Gothenburg, Sweden. ⟨hal-00953719⟩
281 View
98 Download

Share

Gmail Facebook Twitter LinkedIn More