LORIA System for the WMT13 Quality Estimation Shared Task - Archive ouverte HAL Access content directly
Conference Papers Year : 2013

LORIA System for the WMT13 Quality Estimation Shared Task

(1) , (1)
1

Abstract

In this paper we present the system we submitted to the WMT13 shared task on Quality Estimation. We participated to the Task 1.1. Each translated sentence is given a score between 0 and 1. The score is obtained by using several numerical or boolean features calculated according to the source and target sentences. We perform a linear regression of the feature space against scores in the range [0..1], to this end, we use a Support Vector Machine with 66 features. In this paper, we propose to increase the size of the training corpus. For that, we decide to use the post-edited and reference corpora in the training step after assigning a score to each sentence of these corpora. Then, we tune these scores on a development corpus. This leads to an improvement of 10.5% on the development corpus, in terms of Mean Average Error, but achieves only a sligth improvement on the test corpus.
Fichier principal
Vignette du fichier
wmt2013_langlois_smaili_preprint.pdf (154.81 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00923623 , version 1 (15-11-2017)

Identifiers

  • HAL Id : hal-00923623 , version 1

Cite

David Langlois, Kamel Smaïli. LORIA System for the WMT13 Quality Estimation Shared Task. ACL 2013 - Eighth Workshop on Statistical Machine Translation, Aug 2013, Sofia, Bulgaria. pp.380 - 385. ⟨hal-00923623⟩
99 View
78 Download

Share

Gmail Facebook Twitter LinkedIn More