, -decoder for statistical machine translation, EMNLP
A unified architecture for natural language processing, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008. ,
DOI : 10.1145/1390156.1390177
Language modeling with gated convolutional networks, ICML, 2017. ,
Latent alignment and variational attention. arXiv preprint, 2018. ,
Classical Structured Prediction Losses for Sequence to Sequence Learning, Proceedings of the 2018 Conference of the North American Chapter of
the Association for Computational Linguistics: Human Language
Technologies, Volume 1 (Long Papers), 2018. ,
DOI : 10.18653/v1/N18-1033
URL : https://doi.org/10.18653/v1/n18-1033
A Convolutional Encoder Model for Neural Machine Translation, Proceedings of the 55th Annual Meeting of the Association for
Computational Linguistics (Volume 1: Long Papers), 2017. ,
DOI : 10.18653/v1/P17-1012
Convolutional sequence to sequence learning, ICML, 2017. ,
Sequence transduction with recurrent neural networks, 2012. ,
Long Short-Term Memory, Neural Computation, vol.4, issue.8, pp.1735-1780, 1997. ,
DOI : 10.1016/0893-6080(88)90007-X
Densely Connected Convolutional Networks, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. ,
DOI : 10.1109/CVPR.2017.243
URL : http://arxiv.org/pdf/1608.06993
Towards neural phrase-based machine translation, ICLR, 2018. ,
Batch normalization: Accelerating deep network training by reducing internal covariate shift, ICML, 2015. ,
On Using Very Large Target Vocabulary for Neural Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015. ,
DOI : 10.3115/v1/P15-1001
Grid long short-term memory, ICLR, 2016. ,
, 2016b. Neural machine translation in linear time. arXiv
A Convolutional Neural Network for Modelling Sentences, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2014. ,
DOI : 10.3115/v1/P14-1062
Convolutional Neural Networks for Sentence Classification, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014. ,
DOI : 10.3115/v1/D14-1181
Adam: A method for stochastic optimization, ICLR, 2015. ,
Moses, Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ACL '07, 2007. ,
DOI : 10.3115/1557769.1557821
Deep learning, Nature, vol.9, issue.7553, pp.436-444, 2015. ,
DOI : 10.1007/s10994-013-5335-x
A structured selfattentive sentence embedding, 2017. ,
Effective Approaches to Attention-based Neural Machine Translation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015. ,
DOI : 10.18653/v1/D15-1166
URL : https://doi.org/10.18653/v1/d15-1166
Encoding Source Language with Convolutional Neural Network for Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015. ,
DOI : 10.3115/v1/P15-1003
URL : https://doi.org/10.3115/v1/p15-1003
Rectified linear units improve restricted Boltzmann machines, ICML, 2010. ,
Wavenet: a generative model for raw audio, ISCA Speech Syntesis Workshop, 2016. ,
, 2016b. Pixel recurrent neural networks. In ICML
Conditional image generation with PixelCNN decoders, NIPS, 2016. ,
BLEU, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, 2002. ,
DOI : 10.3115/1073083.1073135
A Decomposable Attention Model for Natural Language Inference, Proceedings of the 2016 Conference on Empirical Methods in Natural
Language Processing, 2016. ,
DOI : 10.18653/v1/D16-1244
Automatic differentiation in pytorch, NIPS-W, 2017. ,
Sequence level training with recurrent neural networks, 2016. ,
, , 2017.
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications, 2017. ,
Bidirectional recurrent neural networks, IEEE Transactions on Signal Processing, vol.45, issue.11, pp.2673-2681, 1997. ,
DOI : 10.1109/78.650093
Neural Machine Translation of Rare Words with Subword Units, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016. ,
DOI : 10.18653/v1/P16-1162
Dropout: A simple way to prevent neural networks from overfitting, 2014. ,
Sequence to sequence learning with neural networks, NIPS, 2014. ,
Attention is all you need, NIPS, 2017. ,
Sequence modeling via segmentations, ICML, 2017. ,
,
Adversarial neural machine translation. arXiv, 2017. ,
2015. Show, attend and tell: Neural image caption generation with visual attention, ICML ,
URL : https://hal.archives-ouvertes.fr/hal-01466414