Skip to Main content Skip to Navigation
Conference papers

LORIA System for the WMT13 Quality Estimation Shared Task

David Langlois 1, * Kamel Smaïli 1
* Corresponding author
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper we present the system we submitted to the WMT13 shared task on Quality Estimation. We participated to the Task 1.1. Each translated sentence is given a score between 0 and 1. The score is obtained by using several numerical or boolean features calculated according to the source and target sentences. We perform a linear regression of the feature space against scores in the range [0..1], to this end, we use a Support Vector Machine with 66 features. In this paper, we propose to increase the size of the training corpus. For that, we decide to use the post-edited and reference corpora in the training step after assigning a score to each sentence of these corpora. Then, we tune these scores on a development corpus. This leads to an improvement of 10.5% on the development corpus, in terms of Mean Average Error, but achieves only a sligth improvement on the test corpus.
Complete list of metadata

Cited literature [8 references]  Display  Hide  Download
Contributor : David Langlois Connect in order to contact the contributor
Submitted on : Wednesday, November 15, 2017 - 10:46:28 AM
Last modification on : Saturday, October 16, 2021 - 11:26:08 AM
Long-term archiving on: : Friday, February 16, 2018 - 2:02:34 PM


Files produced by the author(s)


  • HAL Id : hal-00923623, version 1



David Langlois, Kamel Smaïli. LORIA System for the WMT13 Quality Estimation Shared Task. ACL 2013 - Eighth Workshop on Statistical Machine Translation, Aug 2013, Sofia, Bulgaria. pp.380 - 385. ⟨hal-00923623⟩



Les métriques sont temporairement indisponibles