Phrase-based Machine Translation based on Text Mining and Statistical Language Modeling Techniques

Chiraz Latiri 1 Kamel Smaili 2 Caroline Lavecchia 3 Cyrine Nasri 2 David Langlois 2
2 SMarT - Statistical Machine Translation and Speech Modelization and Text
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
3 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we introduce two new methods dedicated to phrase based machine translation. Both are based on mining a parallel corpus in order to nd out the couples of linguistic units which are translation of each other. The presented methods do not rely on any alignment in contrast to what is done usually by the statistical machine translation community. Each of them proposes a complete translation table containing translations of single words and phrases. The rst method is inspired from the well-known trigger language model while the second one is inspired from the association rules mining technique. All experiments ar e conducted on a large part of EUROPARL corpus and highlight the utility of both proposed approaches.
Document type :
Journal articles
Liste complète des métadonnées

Cited literature [13 references]  Display  Hide  Download

https://hal.inria.fr/hal-01270957
Contributor : David Langlois <>
Submitted on : Monday, November 20, 2017 - 10:25:41 AM
Last modification on : Tuesday, February 12, 2019 - 11:38:13 AM
Document(s) archivé(s) le : Wednesday, February 21, 2018 - 12:41:28 PM

File

IJCLA-2011-1-2-pp-193-208.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01270957, version 1

Citation

Chiraz Latiri, Kamel Smaili, Caroline Lavecchia, Cyrine Nasri, David Langlois. Phrase-based Machine Translation based on Text Mining and Statistical Language Modeling Techniques . International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2011, 2 (1-2), pp.16. ⟨hal-01270957⟩

Share

Metrics

Record views

316

Files downloads

93