New Confidence Measures for Statistical Machine Translation

Sylvain Raybaud; Caroline Lavecchia; David Langlois; Kamel Smaïli

Communication Dans Un Congrès Année : 2009

New Confidence Measures for Statistical Machine Translation

(1) , (1) , (1) , (1)

Sylvain Raybaud

Fonction : Auteur
PersonId : 855011

Analysis, perception and recognition of speech

Caroline Lavecchia

Fonction : Auteur
PersonId : 835619

Analysis, perception and recognition of speech

David Langlois

Fonction : Auteur
PersonId : 298
IdHAL : david-langlois
IdRef : 070239509

Analysis, perception and recognition of speech

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Analysis, perception and recognition of speech

Résumé

A confidence measure is able to estimate the reliability of an hypothesis provided by a machine translation system. The problem of confidence measure can be seen as a process of testing : we want to decide whether the most probable sequence of words provided by the machine translation system is correct or not. In the following we describe several original word-level confidence measures for machine translation, based on mutual information, n-gram language model and lexical features language model. We evaluate how well they perform individually or together, and show that using a combination of confidence measures based on mutual information yields a classification error rate as low as 25.1\% with an F-measure of 0.708.

Mots clés

machine translation confidence measure mutual information

Domaines

Informatique et langage [cs.CL] Autre [cs.OH]

Fichier principal

confidence_measures-article.pdf (288.37 Ko)

slides.pdf (529.09 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Autre

Sylvain Raybaud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00333843

Soumis le : vendredi 30 janvier 2009-15:59:45

Dernière modification le : vendredi 24 mars 2023-14:52:51

Archivage à long terme le : lundi 7 juin 2010-18:58:17

Dates et versions

inria-00333843 , version 1 (30-01-2009)

Identifiants

HAL Id : inria-00333843 , version 1
ARXIV : 0902.1033

Citer

Sylvain Raybaud, Caroline Lavecchia, David Langlois, Kamel Smaïli. New Confidence Measures for Statistical Machine Translation. International Conference On Agents and Artificial Intelligence - ICAART 09, Jan 2009, Porto, Portugal. ⟨inria-00333843⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

89 Consultations

227 Téléchargements

New Confidence Measures for Statistical Machine Translation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager