N. Apostoloff and A. W. Fitzgibbon, Learning spatiotemporal Tjunctions for occlusion detection, CVPR, 2005.

X. Bai, J. Wang, D. Simons, and G. Sapiro, Video snapcut: robust video object cutout using localized classifiers, 2009.

M. Black and D. Fleet, Probabilistic detection and tracking of motion discontinuities, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.231-245, 2000.
DOI : 10.1109/ICCV.1999.791271

A. Blake, C. Rother, M. Brown, P. Perez, and P. H. Torr, Interactive Image Segmentation Using an Adaptive GMMRF Model, ECCV, pp.428-441, 2004.
DOI : 10.1007/978-3-540-24670-1_33

R. C. Bolles, H. H. Baker, and D. H. Marimont, Epipolar-plane image analysis: An approach to determining structure from motion, International Journal of Computer Vision, vol.21, issue.1, pp.7-56, 1987.
DOI : 10.1007/BF00128525

E. Boros and P. L. Hammer, Pseudo-Boolean optimization, Discrete Applied Mathematics, vol.123, issue.1-3, pp.155-225, 2002.
DOI : 10.1016/S0166-218X(01)00341-9
URL : https://hal.archives-ouvertes.fr/hal-01150533

Y. Y. Boykov and M. P. Jolly, Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, 2001.
DOI : 10.1109/ICCV.2001.937505

W. Brendel and S. Todorovic, Video object segmentation by tracking regions, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459242
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.163.1234

M. Brox and J. Malik, Object Segmentation by Long Term Analysis of Point Trajectories, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_21

D. Comaniciu and P. Meer, Mean shift: a robust approach toward feature space analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.5, 2002.
DOI : 10.1109/34.1000236

J. P. Costeira and T. Kanade, A multibody factorization method for independently moving objects, International Journal of Computer Vision, vol.29, issue.3, pp.159-179, 1998.
DOI : 10.1023/A:1008000628999

A. Delong, A. Osokin, H. N. Isack, and Y. Boykov, Fast approximate energy minimization with label costs [13] D. Dementhon. Spatio-temporal segmentation of video by hierarchical mean shift analysis, CVPR Statistical Methods in Video Processing Workshop (SMVP), 2002.
DOI : 10.1109/cvpr.2010.5539897
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.184.7526

P. F. Felzenszwalb and D. P. Huttenlocher, Efficient Graph-Based Image Segmentation, International Journal of Computer Vision, vol.59, issue.2, pp.167-181, 2004.
DOI : 10.1023/B:VISI.0000022288.19776.77

D. Goldman, C. Gonterman, B. Curless, D. Salesin, and S. Seitz, Video object annotation, navigation, and composition, Proceedings of the 21st annual ACM symposium on User interface software and technology, UIST '08, 2008.
DOI : 10.1145/1449715.1449719

M. Grundmann, V. Kwatra, M. Han, and I. Essa, Efficient hierarchical graph-based video segmentation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539893
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.294.4979

V. Kolmogorov, Convergent Tree-Reweighted Message Passing for Energy Minimization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.10, pp.1568-1583, 2006.
DOI : 10.1109/TPAMI.2006.200
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.100.2409

V. Kolmogorov and R. Zabih, What energy functions can be minimized via graph cuts?, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.2, pp.147-159, 2004.
DOI : 10.1109/TPAMI.2004.1262177
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.113.1823

N. Komodakis, N. Tziritas, and . Paragios, Fast, Approximately Optimal Solutions for Single and Dynamic MRFs, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383095

M. P. Kumar, P. H. Torr, and A. Zisserman, Learning layered motion segmentations of video, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.138

L. Ladicky, P. Sturgess, K. Alahari, C. Russell, and P. H. Torr, What, Where and How Many? Combining Object Detectors and CRFs, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_31
URL : https://hal.archives-ouvertes.fr/hal-01216730

M. Marsza?ek, I. Laptev, and C. Schmid, Actions in context, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206557

B. Russell, A. Efros, J. Sivic, W. T. Freeman, and A. Zisserman, Using Multiple Segmentations to Discover Objects and their Extent in Image Collections, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.326

B. Russell, A. Torralba, K. Murphy, and W. T. Freeman, LabelMe: A Database and Web-Based Tool for Image Annotation, International Journal of Computer Vision, vol.3, issue.1, 2008.
DOI : 10.1007/s11263-007-0090-8

P. Sand and S. Teller, Particle video: Long-range motion estimation using point trajectories, IJCV, vol.80, issue.1, 2008.
DOI : 10.1007/s11263-008-0136-6

J. Shi and J. Malik, Motion segmentation and tracking using normalized cuts, ICCV, 1998.

J. Shi and J. Malik, Normalized cuts and image segmentation, 2000.

J. Shotton, J. Winn, C. Rother, and A. Criminisi, TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context, International Journal of Computer Vision, vol.62, issue.1???2, 2009.
DOI : 10.1007/s11263-007-0109-1

J. Sivic, F. Schaffalitzky, and A. Zisserman, Object Level Grouping for Video Shots, International Journal of Computer Vision, vol.2, issue.3, pp.189-210, 2006.
DOI : 10.1007/s11263-005-4264-y

P. Smith, R. Drummond, and . Cipolla, Layered motion segmentation and depth ordering by tracking edges, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.4, 2004.
DOI : 10.1109/TPAMI.2004.1265863

A. Stein, D. Hoiem, and M. Hebert, Learning to extract object boundaries using motion cues, ICCV, 2007.
DOI : 10.1109/iccv.2007.4408841
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.206.3239

C. Tomasi and T. Kanade, Detection and tracking of point features, 1991.

P. H. Torr, R. Szeliski, and P. Anandan, An integrated bayesian approach to layer extraction from image sequences, 2001.

A. Vazquez-reina, S. Avidan, H. Pfister, and E. Miller, Multiple Hypothesis Video Segmentation from Superpixel Flows, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_20

J. Wang, B. Thiesson, Y. Xu, and M. Cohen, Image and Video Segmentation by Anisotropic Kernel Mean Shift, ECCV, 2004.
DOI : 10.1007/978-3-540-24671-8_19

J. Y. Wang and E. H. Adelson, Representing moving images with layers, IEEE Transactions on Image Processing, vol.3, issue.5, pp.625-638, 1994.
DOI : 10.1109/83.334981

Y. Weiss, Smoothness in layers: Motion segmentation using nonparametric mixture estimation, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1997.
DOI : 10.1109/CVPR.1997.609375

J. Xiao and M. Shah, Motion layer extraction in the presence of occlusion using graph cuts, 2005.

J. Yan and M. Pollefeys, A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate, ECCV, 2006.
DOI : 10.1007/11744085_8

C. L. Zitnick, N. Jojic, and S. B. Kang, Consistent segmentation for optical flow estimation, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.61