Phrase-based Machine Translation based on Text Mining and Statistical Language Modeling Techniques

Chiraz Latiri 1 Kamel Smaili 2 Caroline Lavecchia 3 Cyrine Nasri 2 David Langlois 2
2 SMarT - Statistical Machine Translation and Speech Modelization and Text
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
3 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we introduce two new methods dedicated to phrase based machine translation. Both are based on mining a parallel corpus in order to nd out the couples of linguistic units which are translation of each other. The presented methods do not rely on any alignment in contrast to what is done usually by the statistical machine translation community. Each of them proposes a complete translation table containing translations of single words and phrases. The rst method is inspired from the well-known trigger language model while the second one is inspired from the association rules mining technique. All experiments ar e conducted on a large part of EUROPARL corpus and highlight the utility of both proposed approaches.
Type de document :
Article dans une revue
International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2011, 2 (1-2), pp.16
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01270957
Contributeur : David Langlois <>
Soumis le : lundi 20 novembre 2017 - 10:25:41
Dernière modification le : jeudi 11 janvier 2018 - 06:27:18
Document(s) archivé(s) le : mercredi 21 février 2018 - 12:41:28

Fichier

IJCLA-2011-1-2-pp-193-208.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01270957, version 1

Citation

Chiraz Latiri, Kamel Smaili, Caroline Lavecchia, Cyrine Nasri, David Langlois. Phrase-based Machine Translation based on Text Mining and Statistical Language Modeling Techniques . International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2011, 2 (1-2), pp.16. 〈hal-01270957〉

Partager

Métriques

Consultations de la notice

281

Téléchargements de fichiers

30