An Efficient Two-Pass Decoder for SMT Using Word Confidence Estimation - Archive ouverte HAL Access content directly
Conference Papers Year : 2014

An Efficient Two-Pass Decoder for SMT Using Word Confidence Estimation

(1) , (1) , (1)
1

Abstract

During decoding, the Statistical Machine Translation (SMT) decoder travels over all complete paths on the Search Graph (SG), seeks those with cheapest costs and back-tracks to read off the best translations. Although these winners beat the rest in model scores, there is no certain guarantee that they have the highest quality with respect to the human references. This paper exploits Word Confidence Estimation (WCE) scores in the second pass of decoding to enhance the Machine Translation (MT) quality. By using the confidence score of each word in the N-best list to update the cost of SG hypotheses containing it, we hope to " reinforce " or " weaken " them relied on word quality. After the update, new best translations are re-determined using updated costs. In the experiments on our real WCE scores and ideal (oracle) ones, the latter significantly boosts one-pass de-coder by 7.87 BLEU points, meanwhile the former yields an improvement of 1.49 points for the same metric.
Fichier principal
Vignette du fichier
eamt14.pdf (386.43 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01002922 , version 1 (23-02-2018)

Identifiers

  • HAL Id : hal-01002922 , version 1

Cite

Ngoc Quang Luong, Laurent Besacier, Benjamin Lecouteux. An Efficient Two-Pass Decoder for SMT Using Word Confidence Estimation. European Association for Machine Translation (EAMT), Jun 2014, Dubrovnik, Croatia. ⟨hal-01002922⟩
143 View
66 Download

Share

Gmail Facebook Twitter LinkedIn More