Phrase-Based Language Model in Statistical Machine Translation

Achraf Ben Romdhane; Salma Jamoussi; Abdelmajid Ben Hamadou; Kamel Smaïli

Article Dans Une Revue International Journal of Computational Linguistics and Applications Année : 2016

Phrase-Based Language Model in Statistical Machine Translation

(1) , (1) , (1) , (2)

1
2

Achraf Ben Romdhane

Fonction : Auteur
PersonId : 963709

Multimedia, InfoRmation systems and Advanced Computing Laboratory

Salma Jamoussi

Fonction : Auteur
PersonId : 757079
IdRef : 083230785

Multimedia, InfoRmation systems and Advanced Computing Laboratory

Abdelmajid Ben Hamadou

Fonction : Auteur
PersonId : 963711

Multimedia, InfoRmation systems and Advanced Computing Laboratory

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Statistical Machine Translation and Speech Modelization and Text

Résumé

As one of the most important modules in statistical machine translation (SMT), language model measures whether one translation hypothesis is more grammatically correct than other hypotheses. Currently the state-of-the-art SMT systems use standard word n-gram models, whereas the translation model is phrase-based. In this paper, the idea is to use a phrase-based language model. For that, target portion of the translation table are retrieved and used to rewrite the training corpus and to calculate a phrase n-gram language model. In this work, we perform experiments with two language models word-based (WBLM) and phrase-based (PBLM). The different SMT are trained with three optimization algorithms MERT, MIRA and PRO. Thus, the PBLM systems are compared to the baseline system in terms of BLUE and TER. The experimental results show that the use of a phrase-based language model in SMT can improve results and is especially able to reduce the error rate.

Mots clés

Machine Translation Phrases Phrase based language model Decoding optimization

Domaines

Informatique et langage [cs.CL]

Fichier principal

CICLING2016.pdf (206.82 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Kamel Smaïli : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01336485

Soumis le : jeudi 23 juin 2016-11:08:40

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-01336485 , version 1 (23-06-2016)

Identifiants

HAL Id : hal-01336485 , version 1

Citer

Achraf Ben Romdhane, Salma Jamoussi, Abdelmajid Ben Hamadou, Kamel Smaïli. Phrase-Based Language Model in Statistical Machine Translation. International Journal of Computational Linguistics and Applications, 2016. ⟨hal-01336485⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE LORIA LORIA-NLPKD

220 Consultations

109 Téléchargements

Phrase-Based Language Model in Statistical Machine Translation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager