G. Ahanger and T. D. Little, A survey of technologies for parsing and indexing digital video. Jal of Visual Communication and Image Representation, p.28843, 1996.

P. Aigrain, H. Zhang, and D. Petrovic, Content-based representation and retrieval of visual media, Multmedia Tools and Applications, vol.3, p.1799202, 1996.

P. Bouthemy and F. Ganansia, Video partitioning and camera motion characterization for content-based video indexing, Proceedings of 3rd IEEE International Conference on Image Processing, 1996.
DOI : 10.1109/ICIP.1996.559646

P. Bouthemy, M. Gelgon, and F. Ganansia, A uniied approach to shot change detection and camera motion characterization, IEEE Trans. on Circuits and Systems for Video T echnology, 1999.

M. G. Christel, M. A. Smith, C. R. Taylor, and D. B. Wincler, Evolving video skims int useful multimedia abstractions, Proceedings of the CHI'98 Conference on Human Factors in Computing Systems, 1998.
DOI : 10.1145/274644.274670

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.124.1473

M. Gelgon and P. Bouthemy, Determining a structured spatio-temporal representation of video content for eecient visualization and indexing, Proc. 5th Eur. Conf. on Computer Vision, ECCV'98, F reiburg, 1998.

B. Gunsel, A. Murat-tekalp, and P. J. Van-beek, Content-based access to video objects: Temporal Segmentation, visual summarization, and feature extraction, Signal Processing, vol.66, issue.2, p.2611280, 1998.
DOI : 10.1016/S0165-1684(98)00010-3

R. Hammoud, L. Chen, and F. Fontaine, An extensible spatial-temporal model for semantic video segmentation, First Int. Forum on Multimedia and Image Processing, 1998.

J. Huang, S. R. Kumar, M. Mitra, W. J. Zhu, and R. Zabih, Image indexing using color correlograms, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p.7622768, 1997.
DOI : 10.1109/CVPR.1997.609412

M. Irani and P. Anandan, Video indexing based on mosaic representations, Proceedings of the IEEE, vol.86, issue.5, p.9055921, 1998.
DOI : 10.1109/5.664279

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.51.9487

R. Kumar, P. Anandan, M. Irani, J. Bergen, and K. Hanna, Representation of scenes from collections of images, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95), p.10017, 1995.
DOI : 10.1109/WVRS.1995.476847

C. A. Lindley and A. M. Vercoustre, A speciication language for dynamic virtual video sequence generation, Int. Symposium Audio, Video, Image Processing and Intelligent Applications, p.17721, 1998.

S. A. Nene and S. K. Nayar, A simple algorithm for nearest neighbor search in high dimensions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.9, p.98991003, 1997.
DOI : 10.1109/34.615448

J. M. Odobez and P. Bouthemy, Robust multiresolution estimation of parametric motion models. Jal of Visual Communication and Image Representation, p.3488365, 1995.

J. M. Odobez and P. Bouthemy, Separation of moving regions from background in an image sequence acquired with a mobile camera, Video Data Compression for Multimedia Computing, c hapter 8, p.2955311, 1997.

. Opera, Dtd for video. Inria Rhne-Alpes, http:::www.inrialpes.frroperaadtdvideo.txt, 1999.

S. Peleg and J. Herman, Panoramic mosaics by manifold projection, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p.3388343, 1997.
DOI : 10.1109/CVPR.1997.609346

R. Razman, R. Al-halimi, W. Hun, and M. Mantei, Four paradigms for indexing video conferences, IEEE MultiMedia, vol.31, p.63373, 1996.

B. Rousso, S. Peleg, and I. Finci, Mosaicing with generalized strips, DARPA Image Understanding Workshop, p.2555260, 1997.

B. Rousso, S. Peleg, I. Finci, and A. Rav-acha, Universal mosaicing using pipe projection, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), p.9455952, 1998.
DOI : 10.1109/ICCV.1998.710830

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.7213

W. J. Rucklidge, Locating objects using the Hausdorr distance, Proceedings of the 5th International Conference on Computer Vision, p.4577464, 1995.

B. Schiele and J. L. Crowley, Object recognition using multidimensional receptive eld histograms, Proceedings of the 4th European Conference on Computer Vision, p.6100619, 1996.
DOI : 10.1007/bfb0015571

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.19

C. Schmid and R. Mohr, Local grayvalue invariants for image retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.5, p.5300534, 1997.
DOI : 10.1109/34.589215

URL : https://hal.archives-ouvertes.fr/inria-00548358

M. Seck, F. Bimbot, D. Zugaj, and B. Delyon, Two-class audio signal segmentation for speech-music-noise detection, Proc. 6th Eur. Conf. on Speech Communication and Technology, EUROSPEECH '99, 1999.

M. Smith and T. Kanade, Video skimming for quick browsing based on audio and image characterization, Proceedings of the Conference on Computer Vision and Pattern Recognition, 1997.

K. K. Sung and T. Poggio, Example-based learning for view-based human face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.1, p.39951, 1998.
DOI : 10.1109/34.655648

M. J. Swain and D. H. Ballard, Color indexing, International Journal of Computer Vision, vol.31, issue.1, p.11132, 1991.
DOI : 10.1007/BF00130487

R. Szeliski, Video mosaics for virtual environments, IEEE Computer Graphics and Applications, vol.16, issue.2, p.22230, 1996.
DOI : 10.1109/38.486677

R. Weber and P. Zezula, A quantitative analysis of performance study for similarity-search methods in high-dimensional spaces, Proceedings of the 24th VLDB Conf, 1998.

H. J. Zhang, SWIM, Extended abstracts of the 2004 conference on Human factors and computing systems , CHI '04, p.5311540, 1996.
DOI : 10.1145/985921.986144