Skip to Main content Skip to Navigation
Conference papers

Training phrase-based SMT without explicit word aligment

Cyrine Nasri 1 Kamel Smaïli 1 Chiraz Latiri 2
1 SMarT - Statistical Machine Translation and Speech Modelization and Text
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 URPAH Tunis
URPAH - Unité de Recherche en Programmation Algorithmique et Heuristique
Abstract : The machine translation systems usually build an initial word-to-word alignment, before training the phrase translation pairs. This approach requires a lot of matching between different single words of both considered languages. In this paper, we propose a new approach for phrase-based machine translation which does not require any word alignment. This method is based on inter-lingual triggers retrieved by Multivariate Mutual Information. This algorithm segments sentences into phrases and fnds their alignments simultaneously. The main objective of this work is to build directly valid alignments between source and target phrases. The achieved results, in terms of performance are satisfactory and the obtained translation table is smaller than the reference one; this approach could be considered as an alternative to the classical methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/hal-01067051
Contributor : Kamel Smaïli <>
Submitted on : Monday, September 22, 2014 - 6:06:40 PM
Last modification on : Wednesday, October 28, 2020 - 2:20:03 PM

File

CICLING2014Nasri.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01067051, version 1

Collections

Citation

Cyrine Nasri, Kamel Smaïli, Chiraz Latiri. Training phrase-based SMT without explicit word aligment. 15th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2014, Kathmandu, Nepal. pp.233-241. ⟨hal-01067051⟩

Share

Metrics

Record views

408

Files downloads

33