Skip to Main content Skip to Navigation
Conference papers

Word- and sentence-level confidence measures for machine translation

Sylvain Raybaud 1, * Caroline Lavecchia 1 David Langlois 1 Kamel Smaïli 1
* Corresponding author
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : A machine translated sentence is seldom completely correct. Confidence measures are designed to detect incorrect words, phrases or sentences, or to provide an estimation of the probability of correctness. In this article we describe several word- and sentence-level confidence measures relying on different features: mutual information between words, n-gram and backward n-gram language models, and linguistic features. We also try different combination of these measures. Their accuracy is evaluated on a classification task. We achieve 17% error-rate (0.84 f-measure) on word-level and 31% error-rate (0.71 f-measure) on sentence-level.
Document type :
Conference papers
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Sylvain Raybaud Connect in order to contact the contributor
Submitted on : Wednesday, September 16, 2009 - 10:43:29 AM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM
Long-term archiving on: : Tuesday, October 16, 2012 - 10:56:02 AM


Files produced by the author(s)


  • HAL Id : inria-00417541, version 1



Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli. Word- and sentence-level confidence measures for machine translation. 13th Annual Meeting of the European Association for Machine Translation - EAMT 09, May 2009, Barcelona, Spain. ⟨inria-00417541⟩



Record views


Files downloads