Word- and sentence-level confidence measures for machine translation - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2009

Word- and sentence-level confidence measures for machine translation

Résumé

A machine translated sentence is seldom completely correct. Confidence measures are designed to detect incorrect words, phrases or sentences, or to provide an estimation of the probability of correctness. In this article we describe several word- and sentence-level confidence measures relying on different features: mutual information between words, n-gram and backward n-gram language models, and linguistic features. We also try different combination of these measures. Their accuracy is evaluated on a classification task. We achieve 17% error-rate (0.84 f-measure) on word-level and 31% error-rate (0.71 f-measure) on sentence-level.
Fichier principal
Vignette du fichier
CM-EAMT.pdf (103 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00417541 , version 1 (16-09-2009)

Identifiants

  • HAL Id : inria-00417541 , version 1

Citer

Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli. Word- and sentence-level confidence measures for machine translation. 13th Annual Meeting of the European Association for Machine Translation - EAMT 09, May 2009, Barcelona, Spain. ⟨inria-00417541⟩
212 Consultations
251 Téléchargements

Partager

Gmail Facebook X LinkedIn More