A. Aner and J. R. Kender, Video Summaries through Mosaic-Based Shot and Scene Clustering, Proc. ECCV, 2002.
DOI : 10.1007/3-540-47979-1_26

M. Ardebilian, X. W. Tu, L. Chen, and P. Faudemay, Video segmentation using 3D hints contained in 2D images, 1996.

A. Baumberg, Reliable feature matching across widely separated views, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), 2000.
DOI : 10.1109/CVPR.2000.855899

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.1666

S. Belongie, J. Malik, and J. Puzicha, Matching shapes, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, 2001.
DOI : 10.1109/ICCV.2001.937552

S. Benayoun, H. Bernard, P. Bertolino, M. Gelgon, C. Schmid et al., Structuration de vidéos pour des interfaces de consultation avancées, Proc. CORESA, 1998.

S. Birchfield, KLT: An implementation of the Kanade- Lucas-Tomasi feature tracker, 1998.

C. Bregler and J. Malik, Tracking people with twists and exponential maps, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), 1998.
DOI : 10.1109/CVPR.1998.698581

J. B. Burns, R. S. Weiss, and E. M. Riseman, View variation of point-set and line-segment features, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.1, 1993.
DOI : 10.1109/34.184774

D. E. Difranco, T. Cham, and J. M. Rehg, Recovery of 3D articulated motion from 2D correspondences, Compaq Cambridge Res. Lab, vol.7, 1999.

O. Faugeras, Q. Luong, and T. Papadopoulo, The Geometry of Multiple Images, 2001.

O. D. Faugeras and M. Hebert, The Representation, Recognition, and Locating of 3-D Objects, The International Journal of Robotics Research, vol.5, issue.3, 1986.
DOI : 10.1177/027836498600500302

M. Fischler and R. Bolles, Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography, CACM, vol.24, issue.6, 1981.

A. Fitzgibbon and A. Zisserman, Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects, Proc. European Conf. Comp. Vision, pp.891-906, 2000.
DOI : 10.1007/3-540-45054-8_58

U. Gargi, R. Kasturi, and S. H. Strayer, Performance characterization of video-shot-change detection methods, IEEE Transactions on Circuits and Systems for Video Technology, vol.10, issue.1, 2000.
DOI : 10.1109/76.825852

U. I. Gupta, D. T. Lee, and Y. Y. Leung, Efficient algorithms for interval graphs and circular-arc graphs, Networks, vol.9, issue.4, p.12, 1982.
DOI : 10.1002/net.3230120410

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, 1988.
DOI : 10.5244/C.2.23

R. Hartley and A. Zisserman, Multiple view geometry in computer vision Surface matching for object recognition in complex three-dimensional scenes, IVC, vol.16, 1998.

R. Lienhart, RELIABLE TRANSITION DETECTION IN VIDEOS: A SURVEY AND PRACTITIONER'S GUIDE, International Journal of Image and Graphics, vol.01, issue.03, 2002.
DOI : 10.1142/S021946780100027X

T. Lindeberg and J. Gårding, Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure, Image and Vision Computing, vol.15, issue.6, p.15, 1997.
DOI : 10.1016/S0262-8856(97)01144-X

D. Lowe, Distinctive image features from scale-invariant keypoints. IJCV, 2003.

S. Mahamud, M. Hebert, Y. Omori, and J. Ponce, Provably-convergent iterative methods for projective structure from motion, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pp.1018-1025, 2001.
DOI : 10.1109/CVPR.2001.990642

K. Mikolajczyk and C. Schmid, An Affine Invariant Interest Point Detector, Proc. ECCV, 2002.
DOI : 10.1007/3-540-47969-4_9

URL : https://hal.archives-ouvertes.fr/inria-00548252

J. L. Mundy and A. Zisserman, Geometric Invariance in Computer Vision, 1992.

C. J. Poelman and T. Kanade, A paraperspective factorization method for shape and motion recovery, PAMI, vol.19, issue.3, 1997.

M. Pollefeys, R. Koch, and L. Van-gool, Self-calibration and metric reconstruction in spite of varying and unknown internal camera parameters, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), 1998.
DOI : 10.1109/ICCV.1998.710705

P. Pritchett and A. Zisserman, Wide baseline stereo matching, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), 1998.
DOI : 10.1109/ICCV.1998.710802

F. Rothganger, S. Lazebnik, C. Schmid, and J. Ponce, 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., 2003.
DOI : 10.1109/CVPR.2003.1211480

URL : https://hal.archives-ouvertes.fr/inria-00548224

F. Schaffalitzky and A. Zisserman, Automated Scene Matching in Movies, Challenges of Image and Video Retrieval, 2002.
DOI : 10.1007/3-540-45479-9_20

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

C. Tomasi and T. Kanade, Shape and motion from image streams under orthography: a factorization method, International Journal of Computer Vision, vol.4, issue.1, 1992.
DOI : 10.1007/BF00129684

T. Tuytelaars and L. Van-gool, Matching widely separated views based on affinely invariant neighborhoods, IJCV, 2003.

M. M. Yeung and B. Liu, Efficient matching and clustering of video shots, Proceedings., International Conference on Image Processing, 1995.
DOI : 10.1109/ICIP.1995.529715

Z. Zhang, Token tracking in a cluttered scene, Image and Vision Computing, vol.12, issue.2, 1994.
DOI : 10.1016/0262-8856(94)90020-5

URL : https://hal.archives-ouvertes.fr/inria-00074599