Word- and sentence-level confidence measures for machine translation

Sylvain Raybaud; Caroline Lavecchia; David Langlois; Kamel Smaïli

Communication Dans Un Congrès Année : 2009

Word- and sentence-level confidence measures for machine translation

(1) , (1) , (1) , (1)

Sylvain Raybaud

Fonction : Auteur correspondant
PersonId : 855011

Connectez-vous pour contacter l'auteur

Analysis, perception and recognition of speech

Caroline Lavecchia

Fonction : Auteur
PersonId : 863263

Analysis, perception and recognition of speech

David Langlois

Fonction : Auteur
PersonId : 298
IdHAL : david-langlois
IdRef : 070239509

Analysis, perception and recognition of speech

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Analysis, perception and recognition of speech

Résumé

A machine translated sentence is seldom completely correct. Confidence measures are designed to detect incorrect words, phrases or sentences, or to provide an estimation of the probability of correctness. In this article we describe several word- and sentence-level confidence measures relying on different features: mutual information between words, n-gram and backward n-gram language models, and linguistic features. We also try different combination of these measures. Their accuracy is evaluated on a classification task. We achieve 17% error-rate (0.84 f-measure) on word-level and 31% error-rate (0.71 f-measure) on sentence-level.

Mots clés

confidence measure machine translation mutual information

Domaines

Traitement du texte et du document

Fichier principal

CM-EAMT.pdf (103 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sylvain Raybaud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00417541

Soumis le : mercredi 16 septembre 2009-10:43:29

Dernière modification le : vendredi 24 mars 2023-14:52:52

Archivage à long terme le : mardi 16 octobre 2012-10:56:02

Dates et versions

inria-00417541 , version 1 (16-09-2009)

Identifiants

HAL Id : inria-00417541 , version 1

Citer

Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli. Word- and sentence-level confidence measures for machine translation. 13th Annual Meeting of the European Association for Machine Translation - EAMT 09, May 2009, Barcelona, Spain. ⟨inria-00417541⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

212 Consultations

252 Téléchargements

Word- and sentence-level confidence measures for machine translation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager