Y. Liu, F. Zhou, W. Liu, F. De-la-torre, and Y. Liu, Unsupervised summarization of rushes videos, Proceedings of the international conference on Multimedia, MM '10, 2010.
DOI : 10.1145/1873951.1874069

S. De-avila and A. Lopes, VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method, Pattern Recognition Letters, vol.32, issue.1, pp.56-68, 2011.
DOI : 10.1016/j.patrec.2010.08.004

Y. J. Lee, J. Ghosh, and K. Grauman, Discovering important people and objects for egocentric video summarization, In: CVPR, vol.1, issue.3 4, 2012.

M. Wang, R. Hong, G. Li, Z. J. Zha, S. Yan et al., Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification, IEEE Transactions on Multimedia, vol.14, issue.4, pp.975-985, 2012.
DOI : 10.1109/TMM.2012.2185041

A. Khosla, R. Hamid, C. J. Lin, and N. Sundaresan, Large-Scale Video Summarization Using Web-Image Priors, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2011.
DOI : 10.1109/CVPR.2013.348

Z. Lu and K. Grauman, Story-Driven Summarization for Egocentric Video, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2005.
DOI : 10.1109/CVPR.2013.350

B. T. Truong and S. Venkatesh, Video abstraction, ACM Transactions on Multimedia Computing, Communications, and Applications, vol.3, issue.1, 1932.
DOI : 10.1145/1198302.1198305

P. Over, A. F. Smeaton, and G. Awad, The trecvid 2008 BBC rushes summarization evaluation, Proceeding of the 2nd ACM workshop on Video summarization, TVS '08, 2008.
DOI : 10.1145/1463563.1463564

Y. F. Ma, X. S. Hua, L. Lu, and H. J. Zhang, A generic framework of user attention model and its application in video summarization, Transactions on Multimedia, vol.3, 2005.

K. Li, S. Oh, A. G. Perera, and Y. Fu, A Videography Analysis Framework for Video Retrieval and Summarization, Procedings of the British Machine Vision Conference 2012, 2012.
DOI : 10.5244/C.26.126

C. W. Ngo, Y. F. Ma, and H. J. Zhang, Video summarization and scene detection by graph modeling. Circuits and Systems for Video Technology, 2005.

A. Divakaran, K. Peker, R. Radhakrishnan, Z. Xiong, and R. Cabasson, Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors, In: Video Mining, vol.6, issue.3, 2003.
DOI : 10.1007/978-1-4757-6928-9_4

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.5.1536

L. Xie, P. Xu, S. F. Chang, A. Divakaran, and H. Sun, Structure analysis of soccer video with domain knowledge and hidden Markov models, Pattern Recognition Letters, vol.25, issue.7, 2004.
DOI : 10.1016/j.patrec.2004.01.005

Y. Rui, A. Gupta, and A. Acero, Automatically extracting highlights for TV Baseball programs, Proceedings of the eighth ACM international conference on Multimedia , MULTIMEDIA '00, 2000.
DOI : 10.1145/354384.354443

H. Sundaram, L. Xie, and S. F. Chang, A utility framework for the automatic generation of audio-visual skims, Proceedings of the tenth ACM international conference on Multimedia , MULTIMEDIA '02, 2002.
DOI : 10.1145/641007.641042

B. Zhao and E. P. Xing, Quasi Real-Time Summarization for Consumer Videos, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.322

Y. Cong, J. Yuan, and J. Luo, Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection, IEEE Transactions on Multimedia, vol.14, issue.1, 2012.
DOI : 10.1109/TMM.2011.2166951

G. Kim, L. Sigal, and E. P. Xing, Joint Summarization of Large-Scale Collections of Web Images and Videos for Storyline Reconstruction, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.538

C. Y. Lin, Rouge: A package for automatic evaluation of summaries, In: Text Summarization Branches, ACL Workshop, pp.74-81, 2004.

D. Hoiem, A. A. Efros, and M. Hebert, Automatic photo pop-up, ACM Transactions on Graphics, vol.24, issue.3, pp.577-584, 2005.
DOI : 10.1145/1073204.1073232

J. Tighe and S. Lazebnik, Superparsing, International Journal of Computer Vision, vol.30, issue.11, 2010.
DOI : 10.1007/s11263-012-0574-z

J. Lezama, K. Alahari, J. Sivic, and I. Laptev, Track to the future: Spatio-temporal video segmentation with long-range motion cues, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.6044588

URL : https://hal.archives-ouvertes.fr/hal-00817961

M. Grundmann, V. Kwatra, M. Han, and I. Essa, Efficient hierarchical graph-based video segmentation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539893

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.294.4979

A. Massoudi, F. Lefebvre, C. H. Demarty, L. Oisel, and B. Chupeau, A Video Fingerprint Based on Visual Digest and Local Fingerprints, 2006 International Conference on Image Processing, p.10, 2006.
DOI : 10.1109/ICIP.2006.312834

V. Chasanis, A. Kalogeratos, and A. Likas, Movie segmentation into scenes and chapters using locally weighted bag of visual words, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, 2009.
DOI : 10.1145/1646396.1646439

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, In: ECCV, p.11, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

D. Oneata, J. Verbeek, and C. Schmid, Action and Event Recognition with Fisher Vectors on a Compact Feature Set, 2013 IEEE International Conference on Computer Vision, p.13, 2013.
DOI : 10.1109/ICCV.2013.228

URL : https://hal.archives-ouvertes.fr/hal-00873662

L. Cao, Y. Mu, A. Natsev, S. F. Chang, G. Hua et al., Scene Aligned Pooling for Complex Video Recognition, In: ECCV, vol.7, p.11, 2012.
DOI : 10.1007/978-3-642-33709-3_49

S. M. Kay, Fundamentals of Statistical signal processing Detection theory, 1998.

Z. Harchaoui, F. Bach, and E. Moulines, Kernel change-point analysis, In: NIPS, issue.6, 2008.

Z. Harchaoui and O. Cappé, Retrospective Mutiple Change-Point Estimation with Kernels, 2007 IEEE/SP 14th Workshop on Statistical Signal Processing, pp.768-772, 2007.
DOI : 10.1109/SSP.2007.4301363

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.78.4776

T. Hastie, R. Tibshirani, and J. Friedman, The elements of statistical learning: data mining, inference and prediction. 2 edn, 2009.

S. Arlot, A. Celisse, and Z. Harchaoui, Kernel change-point detection, 2012.

F. C. Crow, Summed-area tables for texture mapping, ACM SIGGRAPH Computer Graphics, vol.18, issue.3, pp.207-212, 1984.
DOI : 10.1145/964965.808600

A. Gaidon, Z. Harchaoui, and C. Schmid, Temporal localization with actoms, p.10, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00687312

H. Wang, A. Kläser, C. Schmid, and C. L. Liu, Dense Trajectories and Motion Boundary Descriptors for Action Recognition, International Journal of Computer Vision, vol.73, issue.2, p.13, 2013.
DOI : 10.1007/s11263-012-0594-8

URL : https://hal.archives-ouvertes.fr/hal-00725627

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to information retrieval, Cambridge, vol.1, p.13, 2008.
DOI : 10.1017/CBO9780511809071