Skip to Main content Skip to Navigation
Conference papers

LORIA System for the WMT13 Quality Estimation Shared Task

David Langlois 1, * Kamel Smaïli 1
* Corresponding author
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper we present the system we submitted to the WMT13 shared task on Quality Estimation. We participated to the Task 1.1. Each translated sentence is given a score between 0 and 1. The score is obtained by using several numerical or boolean features calculated according to the source and target sentences. We perform a linear regression of the feature space against scores in the range [0..1], to this end, we use a Support Vector Machine with 66 features. In this paper, we propose to increase the size of the training corpus. For that, we decide to use the post-edited and reference corpora in the training step after assigning a score to each sentence of these corpora. Then, we tune these scores on a development corpus. This leads to an improvement of 10.5% on the development corpus, in terms of Mean Average Error, but achieves only a sligth improvement on the test corpus.
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download

https://hal.inria.fr/hal-00923623
Contributor : David Langlois <>
Submitted on : Wednesday, November 15, 2017 - 10:46:28 AM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Document(s) archivé(s) le : Friday, February 16, 2018 - 2:02:34 PM

File

wmt2013_langlois_smaili_prepri...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00923623, version 1

Collections

Citation

David Langlois, Kamel Smaïli. LORIA System for the WMT13 Quality Estimation Shared Task. ACL 2013 - Eighth Workshop on Statistical Machine Translation, Aug 2013, Sofia, Bulgaria. pp.380 - 385. ⟨hal-00923623⟩

Share

Metrics

Record views

235

Files downloads

164