W. J. Hutchins, The Georgetown-IBM Experiment Demonstrated in January 1954, Conference of the Association for Machine Translation in the Americas, pp.102-114, 2004.
DOI : 10.1007/978-3-540-30194-3_12

H. Somers, Example-based machine translation, Machine Translation, vol.14, issue.2, pp.113-157, 1999.
DOI : 10.1023/A:1008109312730

L. Dugast, J. Senellart, and P. Koehn, Statistical post-editing on systran's rule-based translation system, " in Proceedings of the Second Workshop on Statistical Machine Translation, ser. StatMT '07, pp.220-223, 2007.

P. F. Brown, V. J. Pietra, S. A. Pietra, and R. L. Mercer, The mathematics of statistical machine translation: Parameter estimation, Comput. Linguist, vol.19, issue.2, pp.263-311, 1993.

R. Zens, F. J. Och, and H. Ney, Phrase-Based Statistical Machine Translation, pp.18-32, 2002.
DOI : 10.1007/3-540-45751-8_2

N. Kalchbrenner and P. Blunsom, Recurrent continuous translation models, 2013.

I. Sutskever, O. Vinyals, and Q. V. Le, Sequence to sequence learning with neural networks, 1409.

K. Cho, B. Van-merrienboer, D. Bahdanau, and Y. Bengio, On the Properties of Neural Machine Translation: Encoder???Decoder Approaches, Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, 1259.
DOI : 10.3115/v1/W14-4012

Y. Wu, M. Schuster, Z. Chen, Q. V. Le, M. Norouzi et al., Google's neural machine translation system: Bridging the gap between human and machine translation

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

P. Isabelle, C. Cherry, and G. F. Foster, A challenge set approach to evaluating machine translation

P. Koehn and R. Knowles, Six Challenges for Neural Machine Translation, Proceedings of the First Workshop on Neural Machine Translation
DOI : 10.18653/v1/W17-3204

L. Bentivogli, A. Bisazza, M. Cettolo, and M. Federico, Neural versus Phrase-Based Machine Translation Quality: a Case Study, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, p.4631, 1608.
DOI : 10.18653/v1/D16-1025

P. Koehn, F. J. Och, and D. Marcu, Statistical phrase-based translation Available: https, Proceedings of the 2003 Conference of the North American Chapter ser. NAACL '03, pp.48-54, 2003.

J. L. Elman, Finding Structure in Time, Cognitive Science, vol.49, issue.2, pp.179-211, 1990.
DOI : 10.1007/BF00308682

S. Hochreiter and J. Schmidhuber, Long Short-Term Memory, Neural Computation, vol.4, issue.8, pp.1735-1780, 1997.
DOI : 10.1016/0893-6080(88)90007-X

K. Cho, B. Van-merrienboer, C. ¸. Gülçehre, F. Bougares, H. Schwenk et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation Available: http://arxiv.org/abs/1406 Multiun: A multilingual corpus from united nation documents, Proceedings of the Seventh conference on International Language Resources and Evaluation European Language Resources Association (ELRA), pp.1078-2010, 1078.

M. A. Menacer, O. Mella, D. Fohr, D. Jouvet, D. Langlois et al., An enhanced automatic speech recognition system for Arabic, Proceedings of the Third Arabic Natural Language Processing Workshop, 2017.
DOI : 10.18653/v1/W17-1319

URL : https://hal.archives-ouvertes.fr/hal-01531588

N. Qian, On the momentum term in gradient descent learning algorithms, Neural Networks, vol.12, issue.1, pp.145-151, 1999.
DOI : 10.1016/S0893-6080(98)00116-6

S. Ruder, An overview of gradient descent optimization algorithms, 1609.

J. Duchi, E. Hazan, and Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization, Journal of Machine Learning Research, vol.12, pp.2121-2159, 2011.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 1412.

G. Neubig, Neural machine translation and sequence-tosequence models: A tutorial

K. Papineni, S. Roukos, T. Ward, and W. Zhu, BLEU, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.311-318, 2002.
DOI : 10.3115/1073083.1073135

T. Luong, I. Sutskever, Q. V. Le, O. Vinyals, and W. Zaremba, Addressing the Rare Word Problem in Neural Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), p.8206, 1410.
DOI : 10.3115/v1/P15-1002

P. Arthur, G. Neubig, and S. Nakamura, Incorporating Discrete Translation Lexicons into Neural Machine Translation, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
DOI : 10.18653/v1/D16-1162