B. Alexe, T. Deselaers, and V. Ferrari, ClassCut for Unsupervised Class Segmentation, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_28

S. Andrews, I. Tsochantaridis, and T. Hofmann, Support vector machines for multiple-instance learning, NIPS, 2003.

F. Bach, R. Jenatton, J. Mairal, and G. Obozinski, Optimization with Sparsity-Inducing Penalties, Machine Learning, pp.1-106, 2012.
DOI : 10.1561/2200000015
URL : https://hal.archives-ouvertes.fr/hal-00613125

O. Barinova, V. Lempitsky, and P. Kohli, On detection of multiple object instances using hough transforms, IEEE TPAMI, 2012.

S. P. Boyd and L. Vandenberghe, Convex Optimization, 2004.

X. Chen, A. Shrivastava, and A. Gupta, NEIL: Extracting Visual Knowledge from Web Data, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.178

Y. Chen, H. Shioi, C. Montesinos, . Fuentes, L. P. Koh et al., Active detection via adaptive submodularity, ICML, 2014.

O. Chum and A. Zisserman, An Exemplar Model for Learning Object Classes, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383050

D. Crandall and D. Huttenlocher, Weakly Supervised Learning of Part-Based Spatial Models for Visual Object Recognition, ECCV, 2006.
DOI : 10.1007/11744023_2

T. Darrell, S. Sclaroff, and A. Pentland, Segmentation by minimal description, [1990] Proceedings Third International Conference on Computer Vision, 1990.
DOI : 10.1109/ICCV.1990.139506

T. Deselaers, B. Alex, and V. Ferrari, Localizing Objects While Learning Their Appearance, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_33

T. Deselaers, B. Alex, and V. Ferrari, Weakly Supervised Localization and Learning with Generic Knowledge, International Journal of Computer Vision, vol.73, issue.2, p.2012
DOI : 10.1007/s11263-012-0538-3

C. Doersch, S. Singh, A. Gupta, J. Sivic, and A. Efros, What makes paris look like paris, SIGGRAPH, 2012.
DOI : 10.1145/2185520.2185597
URL : https://hal.archives-ouvertes.fr/hal-01053876

C. Doersch, A. Gupta, and A. Efros, Mid-level visual element discovery as discriminative mode seeking, NIPS, 2013.

J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang et al., DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition, ICML, 2014.

I. Endres, K. Shih, and D. Hoeim, Learning Collections of Part Models for Object Recognition, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.126

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, 2007.
DOI : 10.1007/s11263-009-0275-4

P. F. Felzenszwalb, R. B. Girshick, D. Mcallester, and D. And-ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2010.
DOI : 10.1109/TPAMI.2009.167

R. Fergus, P. Perona, and A. Zisserman, Weakly supervised scale-invariant learning of models for visual recognition. IJCV, 2007.

K. Fukunaga and L. Hostetler, The estimation of the gradient of a density function, with applications in pattern recognition. Information Theory, 1975.

C. Galleguillos, B. Babenko, A. Rabinovich, and S. Belongie, Weakly Supervised Object Localization with Stable Segmentations, ECCV, 2008.
DOI : 10.1007/978-3-540-88682-2_16

R. Girshick, J. Donahue, T. Darrell, M. , and J. , Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.81

A. Joulin and F. Bach, A convex relaxation for weakly supervised classifiers, ICML, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00717450

A. Joulin, F. Bach, and J. Ponce, Discriminative clustering for image co-segmentation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539868

M. Juneja, A. Vedaldi, V. Jawahar, and A. Zisserman, Blocks That Shout: Distinctive Parts for Scene Classification, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.124

G. Kim, E. P. Xing, L. Fei-fei, and T. Kanade, Distributed cosegmentation via submodular optimization on anisotropic diffusion, ICCV, 2011.

P. Kumar, B. Packer, and D. Koller, Modeling latent variable uncertainty for loss-based learning, ICML, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00773605

B. Leibe, A. Leonardis, and B. Schiele, Combined object categorization and segmentation with an implicit chape model, ECCVW, 2004.

Y. Li, I. Tsang, J. Kwok, and Z. , Convex and scalable weakly labeled svms, ICML, 2013.

P. M. Long and L. Tan, PAC learning axis-aligned rectangles with respect to product distributions from multiple-instance examples, Proceedings of the ninth annual conference on Computational learning theory , COLT '96, 1996.
DOI : 10.1145/238061.238105

K. Micolajczyk, G. Leibe, and B. Schiele, Multiple Object Class Detection with a Generative Model, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.202

Y. Nesterov, Smooth minimization of non-smooth functions, Mathematical Programming, vol.269, issue.1, 2005.
DOI : 10.1007/s10107-004-0552-5

J. Nocedal and S. Wright, Numerical Optimization, 1999.
DOI : 10.1007/b98874

M. Pandey and S. Lazebnik, Scene recognition and weakly supervised object localization with deformable part-based models, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126383

M. Raptis, I. Kokkinos, and S. Soatto, Discovering discriminative action parts from mid-level video representations, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247807
URL : https://hal.archives-ouvertes.fr/hal-00918807

C. Rother, T. Minka, A. Blake, and V. Kolmogorov, Cosegmentation of Image Pairs by Histogram Matching - Incorporating a Global Constraint into MRFs, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.91

O. Russakovsky, Y. Lin, K. Yu, F. Fei, and L. , Object-Centric Spatial Pooling for Image Classification, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_1

S. Singh, A. Gupta, and A. Efros, Unsupervised Discovery of Mid-Level Discriminative Patches, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_6

P. Siva and T. Xiang, Weakly supervised object detector learning with model drift detection, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126261

P. Siva, C. Russell, and T. Xiang, In Defence of Negative Mining for Annotating Weakly Labelled Data, ECCV, 2012.
DOI : 10.1007/978-3-642-33712-3_43

J. Uijlings, K. Van-de-sande, T. Gevers, and A. Smeulders, Selective Search for Object Recognition, IJCV, 2013.
DOI : 10.1007/s11263-013-0620-5

M. Weber, M. Welling, and P. Perona, Towards automatic discovery of object categories, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), 2000.
DOI : 10.1109/CVPR.2000.854754

M. Weber, M. Welling, and P. Perona, Unsupervised Learning of Models for Recognition, ECCV, 2000.
DOI : 10.1007/3-540-45054-8_2

L. Wolsey, An analysis of the greedy algorithm for the submodular set covering problem, Combinatorica, vol.7, issue.3, pp.385-393, 1982.
DOI : 10.1007/BF02579435

C. N. Yu and T. Joachims, Learning structural SVMs with latent variables, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, 2009.
DOI : 10.1145/1553374.1553523

A. L. Yuille and A. Rangarajan, The Concave-Convex Procedure, Neural Computation, vol.39, issue.4, pp.915-936, 2003.
DOI : 10.1162/08997660260028674