S. Andrews, I. Tsochantaridis, and T. Hofmann, Support vector machines for multiple-instance learning, p.NIPS, 2003.

R. Arandjelovi´carandjelovi´c and A. Zisserman, All about VLAD, p.CVPR, 2013.

C. Baecchi, F. Turchini, L. Seidenari, A. D. Bagdanov, and A. D. Bimbo, Fisher Vectors over Random Density Forests for Object Recognition, 2014 22nd International Conference on Pattern Recognition, p.ICPR, 2014.
DOI : 10.1109/ICPR.2014.712

K. Brki´cbrki´c, A. Pinz, S. Segvi´csegvi´c, and Z. Kalafati´ckalafati´c, Histogram-based description of local space-time appearance, pp.206-217, 2011.

R. Cinbis, J. Verbeek, and C. Schmid, Segmentation Driven Object Detection with Fisher Vectors, 2013 IEEE International Conference on Computer Vision, p.ICCV, 2013.
DOI : 10.1109/ICCV.2013.369

URL : https://hal.archives-ouvertes.fr/hal-00873134

R. Cinbis, J. Verbeek, and C. Schmid, Multi-fold MIL Training for Weakly Supervised Object Localization, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2014.
DOI : 10.1109/CVPR.2014.309

URL : https://hal.archives-ouvertes.fr/hal-00975746

E. J. Crowley and A. Zisserman, Of Gods and Goats: Weakly Supervised Learning of Figurative Art, Procedings of the British Machine Vision Conference 2013, p.BMVC, 2013.
DOI : 10.5244/C.27.39

G. Csurka, C. Bray, C. Dance, and L. Fan, Visual categorization with bags of keypoints, Workshop on Statistical Learning in Computer Vision, ECCV, 2004.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), p.CVPR, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), p.CVPR, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

T. Deselaers, B. Alexe, and V. Ferrari, Weakly Supervised Localization and Learning with Generic Knowledge, International Journal of Computer Vision, vol.73, issue.2, pp.275-293, 2012.
DOI : 10.1007/s11263-012-0538-3

P. Dollár, S. Belongie, and P. Perona, The Fastest Pedestrian Detector in the West, Procedings of the British Machine Vision Conference 2010, p.BMVC, 2010.
DOI : 10.5244/C.24.68

M. Douze and H. Jégou, The Yael Library, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654892

URL : https://hal.archives-ouvertes.fr/hal-01020695

M. Everingham, L. Gool, C. K. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, pp.303-338, 2010.
DOI : 10.1007/s11263-009-0275-4

P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1627-1645, 2010.
DOI : 10.1109/TPAMI.2009.167

B. Fernando, E. Fromont, and T. Tuytelaars, Mining Mid-level Features for Image Classification, International Journal of Computer Vision, vol.7724, issue.3, pp.186-203, 2014.
DOI : 10.1007/s11263-014-0700-1

URL : https://hal.archives-ouvertes.fr/hal-00968299

C. Galleguillos, B. Babenko, A. Rabinovich, and S. J. Belongie, Weakly Supervised Object Localization with Stable Segmentations, pp.193-207, 2008.
DOI : 10.1007/978-3-540-88682-2_16

H. Jégou, M. Douze, and C. Schmid, On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2009.
DOI : 10.1109/CVPR.2009.5206609

R. Jenatton, J. Mairal, G. Obozinski, and F. R. Bach, Proximal methods for hierarchical sparse coding, Journal of Machine Learning Research, vol.12, pp.2297-2334, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00516723

J. Krapac, S. J. Segvi´csegvi´c, and S. Segvi´csegvi´c, Fast Approximate GMM Soft-Assign for Fine-Grained Image Classification with Large Fisher Vectors Weakly supervised object localization with large Fisher vectors, 2015.

J. Krapac, J. Verbeek, and F. Jurie, Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, p.ICCV, 2011.
DOI : 10.1109/ICCV.2011.6126406

URL : https://hal.archives-ouvertes.fr/inria-00612277

C. H. Lampert, M. B. Blaschko, and T. Hofmann, Efficient Subwindow Search: A Branch and Bound Framework for Object Localization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.12, 2009.
DOI : 10.1109/TPAMI.2009.144

D. Liu, G. Hua, P. A. Viola, and T. Chen, Integrated feature selection and higherorder spatial feature extraction for object categorization, p.CVPR, 2008.

J. Mairal, F. Bach, J. Ponce, and G. Sapiro, Online learning for matrix factorization and sparse coding, J. Mach. Learn. Res, vol.11, pp.19-60, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00408716

O. Maron and T. Lozano-pérez, A framework for multiple-instance learning, pp.570-576, 1997.

M. Mathias, R. Timofte, R. Benenson, and L. J. Gool, Traffic sign recognition how far are we from the solution? In: IJCNN, pp.1-8, 2013.

. Mobileye, Traffic Sign Detection, [Online] Available: http://www.mobileye, pp.2015-2022

K. Murphy, Machine learning a probabilistic perspective, 2012.

M. H. Nguyen, L. Torresani, F. De-la-torre, and C. Rother, Learning discriminative localization from weakly labeled data, Pattern Recognition, vol.47, issue.3, pp.1523-1534, 2014.

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, pp.143-156, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek, Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013.
DOI : 10.1007/s11263-013-0636-x

K. Simonyan, A. Vedaldi, and A. Zisserman, Deep fisher networks for large-scale image classification, pp.163-171, 2013.

S. Singh, A. Gupta, and A. A. Efros, Unsupervised Discovery of Mid-Level Discriminative Patches, pp.73-86, 2012.
DOI : 10.1007/978-3-642-33709-3_6

P. Siva, T. Xiang, P. A. Viola, M. J. Jones, B. Cremilleux et al., Weakly supervised object detector learning with model drift detection Robust real-time face detection Histograms of pattern sets for image classification and object recognition Exploiting temporal and spatial constraints in traffic sign detection from a moving vehicle, ternational Journal of Computer Vision CVPR (2014) 38. ? Segvi´cSegvi´c, pp.137-154, 2004.

C. Weng and J. Yuan, Efficient Mining of Optimal AND/OR Patterns for Visual Recognition, IEEE Transactions on Multimedia, vol.17, issue.5, pp.626-635, 2015.
DOI : 10.1109/TMM.2015.2414720

Y. Yang and S. Newsam, Spatial pyramid co-occurrence for image classification, 2011 International Conference on Computer Vision, p.ICCV, 2011.
DOI : 10.1109/ICCV.2011.6126403

J. Yuan, Y. Wu, and M. Yang, Discovery of Collocation Patterns: from Visual Words to Visual Phrases, 2007 IEEE Conference on Computer Vision and Pattern Recognition, p.CVPR, 2007.
DOI : 10.1109/CVPR.2007.383222

M. Yuan and Y. Lin, Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.58, issue.1, pp.49-67, 2006.
DOI : 10.1198/016214502753479356