An actor-critic algorithm for sequence prediction, 2017. ,
Training with exploration improves a greedy stack-LSTM parser, EMNLP, 2016. ,
Scheduled sampling for sequence prediction with recurrent neural networks, NIPS, 2015. ,
Learning reductions that really work, Proceedings of the IEEE, 2016. ,
DOI : 10.1109/jproc.2015.2494118
URL : http://arxiv.org/pdf/1502.02704
Report on the 11th IWSLT evaluation campaign, Proceedings of IWSLT, 2014. ,
Learning to search better than your teacher, ICML, 2015. ,
Learning phrase representations using RNN encoder-decoder for statistical machine translation, EMNLP, 2014. ,
DOI : 10.3115/v1/d14-1179
URL : https://hal.archives-ouvertes.fr/hal-01433235
Learning as search optimization: approximate large margin methods for structured prediction, ICML, 2005. ,
Search-based structured prediction, Machine Learning, 2009. ,
Softmax-margin CRFs: Training loglinear models with cost functions, NAACL, 2010. ,
A dynamic oracle for arc-eager dependency parsing, 2012. ,
Deep Learning, 2016. ,
Noise reduction and targeted exploration in imitation learning for abstract meaning representation parsing, ACL, 2016. ,
A primal-dual message-passing algorithm for approximated large scale structured prediction, NIPS, 2010. ,
Long short-term memory, Neural Computation, 1997. ,
On using very large target vocabulary for neural machine translation, ACL, 2015. ,
DOI : 10.3115/v1/p15-1001
URL : https://doi.org/10.3115/v1/p15-1001
A method for stochastic optimization, ICLR, 2015. ,
Lower bounds for reductions, Talk at the Atomic Learning Workshop (TTI-C), 2006. ,
Multicategory support vector machines: Theory and application to the classification of microarray data and satellite radiance data, Journal of the American Statistical Association, 2004. ,
Reward augmented maximum likelihood for neural structured prediction, NIPS, 2016. ,
Bleu: a method for automatic evaluation of machine translation, ACL, 2002. ,
Entropy and margin maximization for structured output learning, ECML PKDD, 2010. ,
DOI : 10.1007/978-3-642-15939-8_6
URL : https://link.springer.com/content/pdf/10.1007%2F978-3-642-15939-8_6.pdf
Sequence level training with recurrent neural networks, In ICLR, 2016. ,
Self-critical sequence training for image captioning, 2016. ,
DOI : 10.1109/cvpr.2017.131
URL : http://arxiv.org/pdf/1612.00563
Reinforcement and imitation learning via interactive no-regret learning, 2014. ,
Minimum risk training for neural machine translation, 2016. ,
DOI : 10.18653/v1/p16-1159
URL : https://doi.org/10.18653/v1/p16-1159
Deeply AggreVaTeD: Differentiable imitation learning for sequential prediction, 2017. ,
Sequence to sequence learning with neural networks, NIPS, 2014. ,
Rethinking the inception architecture for computer vision, CVPR, 2016. ,
Max-margin Markov networks, NIPS, 2003. Ioannis Tsochantaridis, Thorsten Joachims, Thomas Hofmann, and Yasemin Altun. Large margin methods for structured and interdependent output variables. JMLR, 2005. ,
Show and tell: A neural image caption generator, CVPR, 2015. ,
Sequence-to-sequence learning as beam-search optimization, EMNLP, 2016. ,