Joint 3d estimation of objects and scene layout, NIPS, 2011. ,
Learning the structure of deep sparse graphical models, AISTATS, 2010. ,
What is an object?, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540226
Measuring the Objectness of Image Windows, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.11, pp.2189-2202, 2012. ,
DOI : 10.1109/TPAMI.2012.28
Three things everyone should know to improve object retrieval, CVPR, 2012. ,
NetVLAD: CNN architecture for weakly supervised place recognition Arxiv preprint, 2015. ,
Higher order potentials in end-to-end trainable conditional random fields, 2015. ,
Detecting and sketching the common, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540233
Learning to rank with (a lot of) word features, Information Retrieval, vol.22, issue.1, pp.291-314, 2010. ,
DOI : 10.1007/s10791-009-9117-9
Matching words and pictures, JMLR, vol.3, pp.1107-1135, 2003. ,
Multi-modal Clustering for Multimedia Collections, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007. ,
DOI : 10.1109/CVPR.2007.383223
A Survey on Metric Learning for Feature Vectors and Structured Data. ArXiv e-prints, 1306. ,
Label embedding trees for large multi-class tasks, NIPS, 2011. ,
Animals on the Web, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006. ,
DOI : 10.1109/CVPR.2006.57
Names and faces in the news, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004. ,
DOI : 10.1109/CVPR.2004.1315253
Weakly Supervised Deep Detection Networks, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
DOI : 10.1109/CVPR.2016.311
Weakly supervised object detection with posterior regularization, BMVC, 2014. ,
Pattern recognition and machine learning, 2006. ,
Large-scale machine learning with stochastic gradient descent, COMPSTAT, 2010. ,
Learning tree conditional random fields, ICML, 2010. ,
Visual Recognition with Humans in the Loop, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15561-1_32
Object Segmentation by Long Term Analysis of Point Trajectories, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15555-0_21
Supervised Learning of Semantic Classes for Image Annotation and Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.3, pp.394-410, 2007. ,
DOI : 10.1109/TPAMI.2007.61
The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, 2011. ,
DOI : 10.5244/C.25.76
On-the-fly learning for visual search of large-scale image and video datasets, International Journal of Multimedia Information Retrieval, vol.38, issue.2, 2015. ,
DOI : 10.1007/s13735-015-0077-0
Efficient Maximum Appearance Search for Large-Scale Object Detection, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013. ,
DOI : 10.1109/CVPR.2013.410
NEIL: Extracting Visual Knowledge from Web Data, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.178
Unsupervised object discovery and localization in the wild, CVPR, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01110036
Exploiting hierarchical context on a large database of object categories, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540221
Learning a Similarity Metric Discriminatively, with Application to Face Verification, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005. ,
DOI : 10.1109/CVPR.2005.202
Approximating discrete probability distributions with dependence trees, IEEE Transactions on Information Theory, vol.14, issue.3, pp.462-467, 1968. ,
DOI : 10.1109/TIT.1968.1054142
An Exemplar Model for Learning Object Classes, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007. ,
DOI : 10.1109/CVPR.2007.383050
Deep filter banks for texture recognition and segmentation, CVPR, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01263622
Unsupervised metric learning for face identification in TV video, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126415
URL : https://hal.archives-ouvertes.fr/inria-00611682
Image categorization using Fisher kernels of non-iid image models, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012. ,
DOI : 10.1109/CVPR.2012.6247926
URL : https://hal.archives-ouvertes.fr/hal-00685943
Segmentation Driven Object Detection with Fisher Vectors, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.369
URL : https://hal.archives-ouvertes.fr/hal-00873134
Multi-fold MIL Training for Weakly Supervised Object Localization, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.309
URL : https://hal.archives-ouvertes.fr/hal-00975746
Approximate Fisher Kernels of Non-iid Image Models for Image Categorization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.6, 2016. ,
DOI : 10.1109/TPAMI.2015.2484342
URL : https://hal.archives-ouvertes.fr/hal-01211201
Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.1, 2016. ,
DOI : 10.1109/TPAMI.2016.2535231
URL : https://hal.archives-ouvertes.fr/hal-01123482
Support-vector networks, Machine Learning, pp.273-297, 1995. ,
DOI : 10.1007/BF00994018
Weakly Supervised Learning of Part-Based Spatial Models for Visual Object Recognition, ECCV, 2006. ,
DOI : 10.1007/11744023_2
An Efficient Approach to Semantic Segmentation, International Journal of Computer Vision, vol.60, issue.2, pp.198-212, 2011. ,
DOI : 10.1007/s11263-010-0344-8
Visual categorization with bags of keypoints, ECCV Int. Workshop on Stat. Learning in Computer Vision, 2004. ,
Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005. ,
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512
Information-theoretic metric learning, Proceedings of the 24th international conference on Machine learning, ICML '07, 2007. ,
DOI : 10.1145/1273496.1273523
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.114.4476
Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, issue.1, pp.1-38, 1977. ,
Imagenet: A large-scale hierarchical image database, CVPR, 2009. ,
What Does Classifying More Than 10,000 Image Categories Tell Us?, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15555-0_6
Localizing Objects While Learning Their Appearance, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15561-1_33
Weakly Supervised Localization and Learning with Generic Knowledge, International Journal of Computer Vision, vol.73, issue.2, pp.257-293, 2012. ,
DOI : 10.1007/s11263-012-0538-3
Solving the multiple instance problem with axis-parallel rectangles, Artificial Intelligence, vol.89, issue.1-2, pp.31-71, 1997. ,
DOI : 10.1016/S0004-3702(96)00034-3
Unsupervised Visual Representation Learning by Context Prediction, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.167
Automatic annotation of human actions in video, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459279
Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video, Procedings of the British Machine Vision Conference 2006, 2006. ,
DOI : 10.5244/C.20.92
Taking the bite out of automated naming of characters in TV video, Image and Vision Computing, vol.27, issue.5, pp.545-559, 2009. ,
DOI : 10.1016/j.imavis.2008.04.018
The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, pp.303-338, 2010. ,
DOI : 10.1007/s11263-009-0275-4
Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2010. ,
DOI : 10.1109/TPAMI.2009.167
Multiple Bernoulli relevance models for image and video annotation, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004. ,
DOI : 10.1109/CVPR.2004.1315274
A Visual Category Filter for Google Images, ECCV, 2004. ,
DOI : 10.1007/978-3-540-24670-1_19
Learning object categories from Google's image search, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005. ,
DOI : 10.1109/ICCV.2005.142
Modeling video evolution for action recognition, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7299176
DeViSE: A deep visual-semantic embedding model, NIPS, 2013. ,
Discriminative learning of relaxed hierarchy for largescale visual recognition, ICCV, 2011. ,
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.81
Metric learning by collapsing classes, NIPS, 2006. ,
Generative adversarial nets, NIPS, 2014. ,
A Discriminative Kernel-Based Approach to Rank Images from Text Queries, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.8, pp.1371-1384, 2008. ,
DOI : 10.1109/TPAMI.2007.70791
Draw: A recurrent neural network for image generation view publication, icml, 2015. ,
Multi-component Models for Object Detection, ECCV, 2012. ,
DOI : 10.1007/978-3-642-33765-9_32
Automatic face naming with caption-based supervision, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008. ,
DOI : 10.1109/CVPR.2008.4587603
URL : https://hal.archives-ouvertes.fr/inria-00321048
TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459266
URL : https://hal.archives-ouvertes.fr/inria-00439276
Is that you? Metric learning approaches for face identification, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459197
URL : https://hal.archives-ouvertes.fr/inria-00439290
Multimodal semi-supervised learning for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540120
URL : https://hal.archives-ouvertes.fr/inria-00548640
Multiple Instance Metric Learning from Automatically Labeled Bags of Faces, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15549-9_46
URL : https://hal.archives-ouvertes.fr/inria-00548639
Face Recognition from Caption-Based Supervision, International Journal of Computer Vision, vol.57, issue.2, pp.64-82, 2012. ,
DOI : 10.1007/s11263-011-0447-x
URL : https://hal.archives-ouvertes.fr/inria-00522185
IM2GPS: estimating geographic information from a single image, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008. ,
DOI : 10.1109/CVPR.2008.4587784
Spatial pyramid pooling in deep convolutional networks for visual recognition, ECCV, 2014. ,
The "wake-sleep" algorithm for unsupervised neural networks, Science, vol.268, issue.5214, pp.1158-1161, 1995. ,
DOI : 10.1126/science.7761831
Putting Objects in Perspective, International Journal of Computer Vision, vol.57, issue.2, pp.3-15, 2008. ,
DOI : 10.1007/s11263-008-0137-5
Learning visual groups from co-occurrences in space and time, ICLR, 2016. ,
Exploiting generative models in discriminative classifiers, NIPS, 1999. ,
On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009. ,
DOI : 10.1109/CVPR.2009.5206609
Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, pp.1704-1716, 2012. ,
DOI : 10.1109/TPAMI.2011.235
Automatic image annotation and retrieval using cross-media relevance models, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, 2003. ,
DOI : 10.1145/860435.860459
THUMOS challenge: Action recognition with a large number of classes, 2014. ,
An Introduction to Variational Methods for Graphical Models, Machine Learning, pp.183-233, 1999. ,
DOI : 10.1007/978-94-011-5014-9_5
Discriminative clustering for image cosegmentation, CVPR, 2010. ,
Efficient Image and Video Co-localization with Frank-Wolfe Algorithm, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10599-4_17
Color attributes for object detection, CVPR, 2012. ,
Unifying visual-semantic embeddings with multimodal neural language models, 2015. ,
Dirichlet-Based Histogram Feature Transform for Image Classification, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.413
Robust Higher Order Potentials for Enforcing Label Consistency, International Journal of Computer Vision, vol.24, issue.3, pp.302-324, 2009. ,
DOI : 10.1007/s11263-008-0202-0
Large scale metric learning from equivalence constraints, CVPR, 2012. ,
Efficient inference in fully connected crfs with gaussian edge potentials, NIPS, 2011. ,
Improving web image search results using query-relative classifiers, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540092
URL : https://hal.archives-ouvertes.fr/inria-00548636
Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126406
URL : https://hal.archives-ouvertes.fr/inria-00612277
Imagenet classification with deep convolutional neural networks, NIPS, 2012. URL http ,
Structured learning with approximate inference, NIPS, 2008. ,
Metric Learning: A Survey, Machine Learning, pp.287-364, 2012. ,
DOI : 10.1561/2200000019
Learning the Structure of Deep Architectures Using L1 Regularization, Procedings of the British Machine Vision Conference 2015, 2015. ,
DOI : 10.5244/C.29.23
URL : https://hal.archives-ouvertes.fr/hal-01266462
What, where & how many? combining object detectors and crfs, ECCV, 2010. ,
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.12, pp.31-2129, 2009. ,
DOI : 10.1109/TPAMI.2009.144
Learning to detect unseen object classes by between-class attribute transfer, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009. ,
DOI : 10.1109/CVPR.2009.5206594
Retrieving actions in movies, 2007 IEEE 11th International Conference on Computer Vision, 2007. ,
DOI : 10.1109/ICCV.2007.4409105
A model for learning the semantics of pictures, NIPS, 2003. ,
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006. ,
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585
Handwritten digit recognition with a back-propagation network, NIPS, 1989. ,
Representing and recognizing the visual appearance of materials using three-dimensional textons, International Journal of Computer Vision, vol.43, issue.1, pp.29-44, 2001. ,
DOI : 10.1023/A:1011126920638
Real-time computerized annotation of pictures, Proceedings of the 14th annual ACM international conference on Multimedia , MULTIMEDIA '06, pp.985-1002, 2008. ,
DOI : 10.1145/1180639.1180841
OPTIMOL: Automatic object picture collection via incremental model learning, CVPR, 2007. ,
Codemaps - Segment, Classify and Search Objects Locally, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.454
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
DOI : 10.1109/CVPR.2016.348
Largescale image classification: Fast feature extraction and SVM training, CVPR, 2011. ,
Image annotation via graph learning, Pattern Recognition, vol.42, issue.2, pp.218-228, 2009. ,
DOI : 10.1016/j.patcog.2008.04.012
Fully convolutional networks for semantic segmentation, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7298965
Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999. ,
DOI : 10.1109/ICCV.1999.790410
On the number of linear regions of deep neural networks, 2014. ,
Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams, Procedings of the British Machine Vision Conference 2015, 2015. ,
DOI : 10.5244/C.29.178
Learning Deconvolution Network for Semantic Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.178
New strategies for image annotation: Overview of the photo annotation task at ImageCLEF 2010, Working Notes of CLEF, 2010. ,
An analysis system for scenes containing objects with substructures, ICPR, 1978. ,
Modeling the shape of the scene: a holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001. ,
DOI : 10.1023/A:1011139631724
Sparse coding with an overcomplete basis set: A strategy employed by v1?, Vision Research, vol.37, issue.23, pp.3311-3325, 1997. ,
Robust and efficient models for action recognition and localization, 2015. ,
URL : https://hal.archives-ouvertes.fr/tel-01217362
Action and Event Recognition with Fisher Vectors on a Compact Feature Set, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.228
URL : https://hal.archives-ouvertes.fr/hal-00873662
Spatio-temporal Object Detection Proposals, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10578-9_48
URL : https://hal.archives-ouvertes.fr/hal-01021902
Efficient Action Localization with Approximately Normalized Fisher Vectors, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.326
URL : https://hal.archives-ouvertes.fr/hal-00979594
TRECVID 2012 ? an overview of the goals, tasks, data, evaluation mechanisms and metrics, Proceedings of TRECVID, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00953826
Automatic multimedia crossmodal correlation discovery, ACM SIGKDD, 2004. ,
Scene recognition and weakly supervised object localization with deformable part-based models, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126383
Weakly-and semisupervised learning of a deep convolutional network for semantic image segmentation, ICCV, 2015. ,
Constrained Convolutional Neural Networks for Weakly Supervised Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.209
Reverend Bayes on inference engines: A distributed hierarchical approach, Proceedings of the Second National Conference on Artificial Intelligence, 1982. ,
Action Recognition with Stacked Fisher Vectors, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10602-1_38
A hybrid generative/discriminative classification framework based on free-energy terms, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459453
Free energy score space, NIPS, 2009. ,
Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007. ,
DOI : 10.1109/CVPR.2007.383266
Fisher vectors meet Neural Networks: A hybrid classification architecture, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7298998
Large-scale image categorization with explicit data embedding, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5539914
Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630
Towards good practice in large-scale learning for image classification, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012. ,
DOI : 10.1109/CVPR.2012.6248090
URL : https://hal.archives-ouvertes.fr/hal-00690014
Recurrent convolutional neural networks for scene labeling, ICML, 2014. ,
Learning object class detectors from weakly annotated video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012. ,
DOI : 10.1109/CVPR.2012.6248065
URL : https://hal.archives-ouvertes.fr/hal-00695940
Objects in Context, 2007 IEEE 11th International Conference on Computer Vision, 2007. ,
DOI : 10.1109/ICCV.2007.4408986
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence ,
DOI : 10.1109/TPAMI.2016.2577031
U-Net: Convolutional Networks for Biomedical Image Segmentation, Medical Image Computing and Computer-Assisted Intervention, 2015. ,
DOI : 10.1007/978-3-319-24574-4_28
Object-Centric Spatial Pooling for Image Classification, ECCV, 2012. ,
DOI : 10.1007/978-3-642-33709-3_1
One-shot learning with a hierarchical nonparametric bayesian model, ICML Unsupervised and Transfer Learning workshop, 2012. ,
High-dimensional signature compression for large-scale image classification, CVPR 2011, 2011. ,
DOI : 10.1109/CVPR.2011.5995504
Exponential family Fisher vector for image classification, Pattern Recognition Letters, vol.59, pp.26-32, 2015. ,
DOI : 10.1016/j.patrec.2015.03.010
Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, pp.2216-2223, 2012. ,
DOI : 10.1016/j.patrec.2012.07.019
Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013. ,
DOI : 10.1007/s11263-013-0636-x
Name-It: naming and detecting faces in news videos, IEEE Multimedia, vol.6, issue.1, pp.22-35, 1999. ,
DOI : 10.1109/93.752960
Coordinated Local Metric Learning, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), 2015. ,
DOI : 10.1109/ICCVW.2015.56
URL : https://hal.archives-ouvertes.fr/hal-01215272
Local grayvalue invariants for image retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.5, pp.530-534, 1997. ,
DOI : 10.1109/34.589215
URL : https://hal.archives-ouvertes.fr/inria-00548358
FaceNet: A unified embedding for face recognition and clustering, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7298682
Fully connected deep structured networks, Arxiv preprint, 2015. ,
Active learning literature survey, 2009. ,
Bayesian Joint Topic Modelling for Weakly Supervised Object Localisation, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.371
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation, ECCV, pp.1-15, 2006. ,
DOI : 10.1007/11744023_1
Very deep convolutional networks for large-scale image recognition, 2015. ,
Fisher Vector Faces in the Wild, Procedings of the British Machine Vision Conference 2013, 2013. ,
DOI : 10.5244/C.27.8
In Defence of Negative Mining for Annotating Weakly Labelled Data, ECCV, 2012. ,
DOI : 10.1007/978-3-642-33712-3_43
Weakly supervised object detector learning with model drift detection, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126261
Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003. ,
DOI : 10.1109/ICCV.2003.1238663
Who are you? " : Learning person specific classifiers from video, CVPR, 2009. ,
On learning to localize objects with minimal supervision, ICML, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00996849
Weakly-supervised discovery of visual pattern configurations, NIPS, 2014. ,
ACTIVE: Activity Concept Transitions in Video Event Classification, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.453
SuperParsing: Scalable Nonparametric Image Parsing with Superpixels, IJCV, vol.101, issue.2, pp.329-349, 2013. ,
DOI : 10.1007/978-3-642-15555-0_26
Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.154-171, 2013. ,
DOI : 10.1007/s11263-013-0620-5
Fisher and VLAD with FLAIR, CVPR, 2014. ,
Coloring Local Feature Extraction, ECCV, 2006. ,
DOI : 10.1002/col.10049
URL : https://hal.archives-ouvertes.fr/inria-00548576
Efficient additive kernels via explicit feature maps, CVPR, 2010. ,
Region Classification with Markov Field Aspect Models, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007. ,
DOI : 10.1109/CVPR.2007.383098
URL : https://hal.archives-ouvertes.fr/inria-00321129
Scene segmentation with CRFs learned from partially labeled images, NIPS, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00321051
Large-scale live active learning: Training object detectors with crawled data and crowds, CVPR 2011, 2011. ,
DOI : 10.1109/CVPR.2011.5995430
Filter-based mean-field inference for random fields with higher-order terms and product label-spaces, 2014. ,
Robust Real-Time Face Detection, International Journal of Computer Vision, vol.57, issue.2, pp.137-154, 2004. ,
DOI : 10.1023/B:VISI.0000013087.49260.fb
Weakly Supervised Object Localization with Latent Category Learning, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10599-4_28
Action Recognition with Improved Trajectories, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.441
URL : https://hal.archives-ouvertes.fr/hal-00873267
Dense Trajectories and Motion Boundary Descriptors for Action Recognition, International Journal of Computer Vision, vol.73, issue.2, pp.60-79, 2013. ,
DOI : 10.1007/s11263-012-0594-8
URL : https://hal.archives-ouvertes.fr/hal-00725627
A Robust and Efficient Video Representation for Action Recognition, International Journal of Computer Vision, vol.103, issue.1, p.2015 ,
DOI : 10.1007/s11263-015-0846-5
URL : https://hal.archives-ouvertes.fr/hal-01145834
Two-stage metric learning, ICML, 2014. ,
Unsupervised Learning of Visual Representations Using Videos, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.320
Statistical pattern recognition, 2002. ,
Distance metric learning for large margin nearest neighbor classification, JMLR, vol.10, pp.207-244, 2009. ,
Distance metric learning for large margin nearest neighbor classification, NIPS, 2006. ,
WSABIE: Scaling up to large vocabulary image annotation, IJCAI, 2011. ,
Object categorization by learned universal visual dictionary, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005. ,
DOI : 10.1109/ICCV.2005.171
Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation, CIVR, 2005. URL www.edschofield.com/publications ,
DOI : 10.1007/11526346_54
Understanding belief propagation and its generalizations, 2002. ,
Discriminative subvolume search for efficient action detection, CVPR, 2009. ,
SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2126-2136, 2006. ,
DOI : 10.1109/CVPR.2006.301
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007. ,
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/inria-00548574
Edge Boxes: Locating Object Proposals from Edges, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10602-1_26
Deep learning of invariant features via simulated fixations in video, NIPS, 2012. Appendix A Curriculum vitae 81 ,
38330 Montbonnot, France Email: Jakob.Verbeek@inria.fr Webpage: http://thoth.inrialpes.fr/?verbeek Citizenship: Dutch, Date of birth, 1975. ,
Thesis: Mixture models for clustering and dimension reduction Thesis: An information theoretic approach to finding word groups for text classification Thesis: Overfitting using the minimum description length principle. Awards 2011 ? Outstanding Reviewer Award Professional Activities Participation in Research Projects 2016-2018 ? Structured prediction for weakly supervised semantic segmentation, funded by Facebook Artificial Intelligence Research (FAIR) Paris and French national research and technology agency (ANRT). 2015-2016 ? Incremental learning for object category localization, Informatics Institute Dutch National Research Institute for Mathematics and Computer Science & University of Amsterdam. Advisors: Prof. Dr. P. Vitányi, Dr. P. GrünwaldGr¨Grünwald, and Dr. R. de Wolf ? Outstanding Reviewer Award, IEEE Conference on Computer Vision and Pattern Recognition ? Researcher (CR1), INRIA RhôneRh?Rhône-Alpes-2005 ? Postdoc, Intelligent Autonomous Systems group, Informatics Institute MBDA Systems. 2013-2016 ? Physionomie: Physiognomic Recognition for Forensic Investigation , funded by French national research agency (ANR). 2011-2015 ? AXES: Access to Audiovisual Archives, European integrated project, 7th Framework Programme. 2010-2013 ? Quaero Consortium for Multimodal Person Recognition, funded by French national research agency (ANR). 2009-2012 ? Modeling multi-media documents for cross-media access, funded by Xerox Research Centre Europe (XRCE) and French national research and technology agency (ANRT). 2008-2010 ? Interactive Image Search, funded by French national research agency (ANR). 2006-2009 ? Cognitive-Level Annotation using Latent Statistical Structure (CLASS), funded by European Union Sixth Framework Programme. 2000-2005 ? Tools for Non-linear Data Analysis, funded by Dutch Technology Foundation (STW), 1998. ,
Netherlands Organisation for Scientific Research (NWO) Miscellaneous Research Visits 2011 ? Visiting researcher Statistical Machine Learning group, Miscellaneous (continued) Summer Schools & Workshops 2015 ? DGA workshop on Big Data in Multimedia Information Processing, 2003. ,
2014 ? 3rd Croatian Computer Vision Workshop, Center of Excellence for Computer Vision, ? 2nd IST Workshop on Computer Vision and Machine Learning, 2011. ,
Image categorization using Fisher kernels of non-iid image models Modelling spatial layout for image classification, ? Statistical Machine Learning group, 2011. ,
? Content Analysis group, Xerox Research Centre Europe, Manifold learning: unsupervised, correspondences, and semi-supervised. 2005 ? Learning and Recognition in Vision group, INRIA RhôneRh?Rhône-Alpes, Manifold learning & image segmentation Manifold learning with local linear models and Gaussian fields. 2004 ? Algorithms and Complexity group, Dutch Center for Mathematics and Computer Science, Semi-supervised dimension reduction through smoothing on graphs Spectral methods for dimension reduction and nonlinear CCA A generative model for the Self-Organizing Map. Publications In peer reviewed international journals Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning, ? Information and Language Processing Systems group ? G. Cinbis, J. Verbeek, C. Schmid. Approximate Fisher kernels of non-iid image models for image categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002. ,
A Robust and Efficient Video Representation for Action Recognition, International Journal of Computer Vision, vol.103, issue.1, 2015. ,
DOI : 10.1007/s11263-015-0846-5
URL : https://hal.archives-ouvertes.fr/hal-01145834
Circulant Temporal Encoding for Video Retrieval and Temporal Alignment, International Journal of Computer Vision, vol.33, issue.4, 2013. ,
DOI : 10.1007/s11263-015-0875-0
URL : https://hal.archives-ouvertes.fr/hal-01162603
Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013. ,
DOI : 10.1007/s11263-013-0636-x
Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.11, pp.2624-2637, 2013. ,
DOI : 10.1109/TPAMI.2013.83
URL : https://hal.archives-ouvertes.fr/hal-00817211
Tree-Structured CRF Models for Interactive Image Labeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.2, pp.476-489, 2010. ,
DOI : 10.1109/TPAMI.2012.100
URL : https://hal.archives-ouvertes.fr/hal-00688143
Category Level Object Segmentation by Combining Bag-of-Words Models with Dirichlet Processes and Random Fields, International Journal of Computer Vision, vol.77, issue.1???3, pp.238-253, 2009. ,
DOI : 10.1007/s11263-009-0245-x
URL : https://hal.archives-ouvertes.fr/inria-00439303
Learning Color Names for Real-World Applications, IEEE Transactions on Image Processing, vol.18, issue.7, pp.1512-1523, 2006. ,
DOI : 10.1109/TIP.2009.2019809
URL : https://hal.archives-ouvertes.fr/inria-00439284
Gaussian fields for semi-supervised regression and correspondence learning, Pattern Recognition, vol.39, issue.10, pp.1864-1875, 2006. ,
DOI : 10.1016/j.patcog.2006.04.011
URL : https://hal.archives-ouvertes.fr/inria-00321133
Learning nonlinear image manifolds by global alignment of local linear models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.8, pp.1236-1250, 2005. ,
DOI : 10.1109/TPAMI.2006.166
URL : https://hal.archives-ouvertes.fr/inria-00321131
Active Appearance-Based Robot Localization Using Stereo Vision, Autonomous Robots, vol.18, issue.1, pp.59-80, 2005. ,
DOI : 10.1023/B:AURO.0000047287.00119.b6
URL : https://hal.archives-ouvertes.fr/inria-00321476
Self-organizing mixture models, Neurocomputing, vol.63, pp.99-123, 2003. ,
DOI : 10.1016/j.neucom.2004.04.008
URL : https://hal.archives-ouvertes.fr/inria-00321479
Efficient Greedy Learning of Gaussian Mixture Models, Neural Computation, vol.35, issue.1, pp.469-485, 2003. ,
DOI : 10.1214/aos/1176344374
URL : https://hal.archives-ouvertes.fr/inria-00321487
The global k-means clustering algorithm, Pattern Recognition, vol.36, issue.2, pp.451-461, 2002. ,
DOI : 10.1016/S0031-3203(02)00060-2
URL : https://hal.archives-ouvertes.fr/inria-00321493
A k-segments algorithm for finding principal curves, Pattern Recognition Letters, vol.23, issue.8, pp.1009-1017, 2002. ,
DOI : 10.1016/S0167-8655(02)00032-6
URL : https://hal.archives-ouvertes.fr/inria-00321497
Efficient Action Localization with Approximately Normalized Fisher Vectors Segmentation Driven Object Detection with Fisher Vectors, Proceedings IEEE Conference on Computer Vision and Pattern Recognition Proceedings IEEE International Conference on Computer Vision, 2013. ,
Action and Event Recognition with Fisher Vectors on a Compact Feature Set Metric learning for large scale image classification: generalizing to new classes at near-zero cost, Proceedings IEEE International Conference on Computer Vision Proceedings European Conference on Computer Vision, 2012. ,
Image categorization using Fisher kernels of non-iid image models, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2011. ,
DOI : 10.1109/CVPR.2012.6247926
URL : https://hal.archives-ouvertes.fr/hal-00685943
Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126406
URL : https://hal.archives-ouvertes.fr/inria-00612277
Unsupervised metric learning for face identification in TV video, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126415
URL : https://hal.archives-ouvertes.fr/inria-00611682
Learning tree-structured descriptor quantizers for image categorization, Proceedings British Machine Vision Conference, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00613118
Learning structured prediction models for interactive image labeling Multiple instance metric learning from automatically labeled bags of faces, Proceedings IEEE Conference on Computer Vision and Pattern Recognition Proceedings European Conference on Computer Vision, 2010. ,
Multimodal semi-supervised learning for image classication, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, 2010. ,
Improving web image search results using query-relative classifiers, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. ,
DOI : 10.1109/CVPR.2010.5540092
URL : https://hal.archives-ouvertes.fr/inria-00548636
Trans Media Relevance Feedback for Image Autoannotation, Procedings of the British Machine Vision Conference 2010, 2010. ,
DOI : 10.5244/C.24.20
URL : https://hal.archives-ouvertes.fr/inria-00548632
EP for efficient stochastic control with obstacles, Proceedings European Conference on Artificial Intelligence, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00548631
Image annotation with tagprop on the MIRFLICKR set, Proceedings of the international conference on Multimedia information retrieval, MIR '10, 2009. ,
DOI : 10.1145/1743384.1743476
URL : https://hal.archives-ouvertes.fr/inria-00548628
TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459266
URL : https://hal.archives-ouvertes.fr/inria-00439276
Is that you? Metric learning approaches for face identification, 2009 IEEE 12th International Conference on Computer Vision, 2009. ,
DOI : 10.1109/ICCV.2009.5459197
URL : https://hal.archives-ouvertes.fr/inria-00439290
Verbeek Ranking user-annotated images for multiple query terms Automatic face naming with caption-based supervision, Proceedings British Machine Vision Conference Proceedings IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2008. ,
Improving People Search Using Query Expansions, Proceedings European Conference on Computer Vision, pp.86-99, 2008. ,
DOI : 10.1007/978-3-540-88688-4_7
URL : https://hal.archives-ouvertes.fr/inria-00321045
Scene segmentation with CRFs learned from partially labeled images, Advances in Neural Information Processing Systems, pp.1553-1560, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00321051
Semi-supervised dimensionality reduction using pairwise equivalence constraints Learning color names from real-world images, Proceedings International Conference on Computer Vision Theory and Applications Proceedings IEEE Conference on Computer Vision and Pattern Recognition, pp.489-496, 2007. ,
Region Classification with Markov Field Aspect Models, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007. ,
DOI : 10.1109/CVPR.2007.383098
URL : https://hal.archives-ouvertes.fr/inria-00321129
Using high-level visual information for color constancy Transformation invariant component analysis for binary images Non-linear CCA and PCA by alignment of local models, Proceedings IEEE International Conference on Computer Vision Proceedings IEEE Conference on Computer Vision and Pattern Recognition Advances in Neural Information Processing Systems 16, pp.1-8, 2003. ,
Enhancing appearance-based robot localization using non-dense disparity maps, Proceedings International Conference on Intelligent Robots and Systems, pp.980-985, 2003. ,
Self-organization by optimizing free-energy, Proceedings 11th European Symposium on Artificial Neural Networks, pp.125-130, 2002. ,
URL : https://hal.archives-ouvertes.fr/inria-00321491
Coordinating Principal Component Analyzers, Proceedings International Conference on Artificial Neural Networks, pp.914-919, 2002. ,
DOI : 10.1007/3-540-46084-5_148
URL : https://hal.archives-ouvertes.fr/inria-00321498
Fast nonlinear dimensionality reduction with topology preserving networks, Proceedings 10th European Symposium on Artificial Neural Networks, pp.193-198, 2001. ,
URL : https://hal.archives-ouvertes.fr/inria-00321500
A Soft k-Segments Algorithm for Principal Curves, Proceedings International Conference on Artificial Neural Networks, pp.450-456, 2001. ,
DOI : 10.1007/3-540-44668-0_63
URL : https://hal.archives-ouvertes.fr/inria-00321506
Large scale metric learning for distance-based image classification on open ended data sets Advances in Computer Vision and Pattern Recognition, Color in Computer Vision, Wiley, 2012. Workshops and regional conferences 2015 ? S. Saxena, and J. Verbeek. Coordinated Local Metric Learning. ICCV ChaLearn Looking at People workshop, 2012. ,
Patch-level Spatial Layout for Classification and Weakly Supervised Localization German Conference on Pattern Recognition The INRIA-LIM-VocR and AXES submissions to Trecvid 2014 Multimedia Event Detection, 2013. ,
QCompere @ REPERE 2013 Workshop on Speech, Language and Audio for Multimedia, Parkhi, and R. Arandjelovic, A. Zisserman, F. Basura, and T. Tuytelaars. AXES at TRECVid 2012: KIS, INS, and MED. TRECVID Workshop, 2012. ,
Fusion of Speech, Faces and Text for Person Identification in TV Broadcast, Learning to Rank and Quadratic Assignment. NIPS Workshop on Discrete Optimization in Machine Learning ? T. Mensink, G. Csurka, F. Perronnin, J. Sánchez, and J. Verbeek. LEAR and XRCEs participation to Visual Concept Detection Task -ImageCLEF 2010. Working Notes for the CLEF 2010 Workshop, 2010. ,
DOI : 10.1007/978-3-642-33885-4_39
URL : https://hal.archives-ouvertes.fr/hal-00722884
Apprentissage de distance pour l'annotation d'images par plus proches voisins. Reconnaissance des Formes et Intelligence Artificielle INRIA-LEARs participation to ImageCLEF Working Notes for the CLEF, ? J. Nunnink, J. Verbeek, and N. Vlassis. Accelerated greedy mixture learning. Proceedings Annual Machine Learning Conference of Belgium and the Netherlands, pp.80-86, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00439309
A variational EM algorithm for large-scale mixture modeling, Proceedings Conference of the Advanced School for Computing and Imaging, pp.136-143, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00321486
Non-linear feature extraction by the coordination of mixture models, Proceedings Conference of the Advanced School for Computing and Imaging, pp.287-293, 2002. ,
URL : https://hal.archives-ouvertes.fr/inria-00321490
Locally linear generative topographic mapping, Proceedings Annual Machine Learning Conference of Belgium and the Netherlands, pp.79-86, 2001. ,
URL : https://hal.archives-ouvertes.fr/inria-00321501
Efficient Greedy Learning of Gaussian Mixture Models, Proceedings 13th Belgian- Dutch Conference on Artificial Intelligence, pp.251-258, 2001. ,
DOI : 10.1214/aos/1176344374
URL : https://hal.archives-ouvertes.fr/inria-00321487
Greedy Gaussian mixture learning for texture segmentation. (oral) ICANN Workshop on Kernel and Subspace Methods for Computer Vision, Publications, pp.37-46, 2000. ,
URL : https://hal.archives-ouvertes.fr/inria-00321513
Supervised feature extraction for text categorization Using a sample-dependent coding scheme for two-part MDL, Proceedings Annual Machine Learning Conference of Belgium and the Netherlands Proceedings Machine Learning & Applications (ACAI '99), 1999. ,
Metric Learning for Nearest Class Mean Classifiers United States Patent Application 20140029839, Publication date: 01/30/2014, filing date: 07/30/2012, XEROX Corporation Learning Structured prediction models for interactive image labeling. United States Patent Application 20120269436, Publication date: 25, XEROX Corporation Retrieval systems and methods employing probabilistic cross-media relevance feedback, p.31, 2010. ,
Image classification with the Fisher vector: theory and practice Large scale metric learning for distance-based image classification Region-based image classification with a latent SVM model, 2011. ,
Spatial Fisher vectors for image categorization, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00613572
Weighted transmedia relevance feedback for image retrieval and autoannotation Face recognition from caption-based supervision Category level object segmentation by combining bag-of-words models and Markov random fields, ? J. Verbeek, and N. Vlassis. Semi-supervised learning with Gaussian fields, 2005. ,
Rodent behavior annotation from video, ? J. Verbeek, and N. Vlassis. Gaussian mixture learning from noisy data, 2002. ,
URL : https://hal.archives-ouvertes.fr/inria-00548500
The generative self-organizing map: a probabilistic generalization of Kohonen's SOM, 2002. ,
Procrustes analysis to coordinate mixtures of probabilistic principal component analyzers The global k-means clustering algorithm, 2001. ,
Efficient Greedy Learning of Gaussian Mixture Models, Neural Computation, vol.35, issue.1, 2000. ,
DOI : 10.1214/aos/1176344374
URL : https://hal.archives-ouvertes.fr/inria-00321487
A k-segments algorithm for finding principal curves, Pattern Recognition Letters, vol.23, issue.8, 2000. ,
DOI : 10.1016/S0167-8655(02)00032-6
URL : https://hal.archives-ouvertes.fr/inria-00321497