CIDEr: Consensus-based image description evaluation, pp.4566-4575, 2015. ,
Show and tell: A neural image caption generator, pp.3156-3164, 2015. ,
Automatic image semantic interpretation using social action and tagging data, Multimedia Tools & Applications, vol.51, pp.213-246, 2011. ,
Bridging the Semantic Gap Between Image Contents and Tags, IEEE Transactions on Multimedia, vol.12, pp.462-473, 2010. ,
Visualizing and Understanding Convolutional Networks, vol.8689, pp.818-833, 2013. ,
Evaluating the application of semantic inferencing rules to image annotation, International Conference on Knowledge Capture ACM, pp.91-98, 2005. ,
Deep visual-semantic alignments for generating image descriptions, A neural Transactions on Pattern Analysis & Machine Intelligence, vol.39, pp.664-676, 2017. ,
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention, Computer Science, pp.2048-2057, 2015. ,
ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, vol.115, pp.211-252, 2015. ,
Very Deep Convolutional Networks for Large-Scale Image Recognition, Computer Science, 2014. ,
Guiding the Long-Short Term Memory Model for Image Caption Generation, IEEE International Conference on Computer Vision IEEE, pp.2407-2415, 2016. ,
A Method for Automatic Evaluation of Machine Translation, Proc Acl, 2002. ,