Skip to Main content Skip to Navigation
Conference papers

Training phrase-based SMT without explicit word aligment

Cyrine Nasri 1 Kamel Smaïli 1 Chiraz Latiri 2 
1 SMarT - Statistical Machine Translation and Speech Modelization and Text
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 URPAH Tunis
URPAH - Unité de Recherche en Programmation Algorithmique et Heuristique
Abstract : The machine translation systems usually build an initial word-to-word alignment, before training the phrase translation pairs. This approach requires a lot of matching between different single words of both considered languages. In this paper, we propose a new approach for phrase-based machine translation which does not require any word alignment. This method is based on inter-lingual triggers retrieved by Multivariate Mutual Information. This algorithm segments sentences into phrases and fnds their alignments simultaneously. The main objective of this work is to build directly valid alignments between source and target phrases. The achieved results, in terms of performance are satisfactory and the obtained translation table is smaller than the reference one; this approach could be considered as an alternative to the classical methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [15 references]  Display  Hide  Download
Contributor : Kamel Smaïli Connect in order to contact the contributor
Submitted on : Monday, September 22, 2014 - 6:06:40 PM
Last modification on : Saturday, October 16, 2021 - 11:26:09 AM


Files produced by the author(s)


  • HAL Id : hal-01067051, version 1



Cyrine Nasri, Kamel Smaïli, Chiraz Latiri. Training phrase-based SMT without explicit word aligment. 15th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2014, Kathmandu, Nepal. pp.233-241. ⟨hal-01067051⟩



Record views


Files downloads