Neural machine translation by jointly learning to align and translate, ICLR, 2015. ,
An actor-critic algorithm for sequence prediction, ICLR, 2017. ,
Training with Exploration Improves a Greedy Stack LSTM Parser, Proceedings of the 2016 Conference on Empirical Methods in Natural
Language Processing, 2016. ,
DOI : 10.18653/v1/D16-1211
Learning Reductions That Really Work, Proceedings of the IEEE, pp.136-147, 2016. ,
DOI : 10.1109/JPROC.2015.2494118
Learning to search better than your teacher, ICML, 2015. ,
Learning Phrase Representations using RNN Encoder???Decoder for Statistical Machine Translation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014. ,
DOI : 10.3115/v1/D14-1179
URL : https://hal.archives-ouvertes.fr/hal-01433235
Natural language processing (almost) from scratch, Journal of Machine Learning Research, vol.12, pp.2493-2537, 2011. ,
Learning as search optimization, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005. ,
DOI : 10.1145/1102351.1102373
Search-based structured prediction, Machine Learning, 2009. ,
DOI : 10.1007/s10994-009-5106-x
A dynamic oracle for arc-eager dependency parsing, 2012. ,
Deep Learning, 2016. ,
Long Short-Term Memory, Neural Computation, vol.4, issue.8, 1997. ,
DOI : 10.1016/0893-6080(88)90007-X
On Using Very Large Target Vocabulary for Neural Machine Translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015. ,
DOI : 10.3115/v1/P15-1001
Effective Approaches to Attention-based Neural Machine Translation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015. ,
DOI : 10.18653/v1/D15-1166
URL : http://aclweb.org/anthology/D/D15/D15-1166.pdf
Entropy and Margin Maximization for Structured Output Learning, ECML PKDD, 2010. ,
DOI : 10.1007/978-3-642-15939-8_6
URL : http://www.pletscher.org/papers/pletscher2010maxentmarg.pdf
Sequence level training with recurrent neural networks, 2016. ,
Self-Critical Sequence Training for Image Captioning, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
DOI : 10.1109/CVPR.2017.131
Reinforcement and imitation learning via interactive no-regret learning, 2014. ,
Dropout: a simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, pp.1929-1958, 2014. ,
Deeply aggrevated: Differentiable imitation learning for sequential prediction, 2017. ,
Sequence to sequence learning with neural networks, 2014. ,
Max-margin Markov networks, NIPS, 2003. ,
Introduction to the CoNLL-2000 shared task: Chunking, CoNLL, 2000. ,
Large margin methods for structured and interdependent output variables, JMLR, 2005. ,
Show and tell: A neural image caption generator, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7298935
Sequence-to-Sequence Learning as Beam-Search Optimization, Proceedings of the 2016 Conference on Empirical Methods in Natural
Language Processing, 2016. ,
DOI : 10.18653/v1/D16-1137
URL : http://arxiv.org/abs/1606.02960