R. D. Green and L. Guan, Quantifying and Recognizing Human Movement Patterns From Monocular Video Images???Part I: A New Framework for Modeling Human Motion, IEEE Transactions on Circuits and Systems for Video Technology, vol.14, issue.2, pp.179-190, 2004.
DOI : 10.1109/TCSVT.2003.821976

G. Guerra-filho and Y. Aloimonos, A Language for Human Action, Computer, vol.40, issue.5, pp.42-51, 2007.
DOI : 10.1109/MC.2007.154

C. Bregler, Learning and recognizing human dynamics in video sequences, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1997.
DOI : 10.1109/CVPR.1997.609382

C. H. Lee, F. Soong, and B. H. Juang, A segment model based approach to speech recognition, In: ICASSP, 1988.

L. Rabiner and B. Juang, Fundamentals of speech recognition, 1993.

R. Poppe, A survey on vision-based human action recognition, Image and Vision Computing, vol.28, issue.6, pp.976-990, 2010.
DOI : 10.1016/j.imavis.2009.11.014

J. Yamato, J. Ohya, and K. Ishii, Recognizing human action in time-sequential images using hidden Markov model, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.379-385, 1992.
DOI : 10.1109/CVPR.1992.223161

A. F. Bobick and J. W. Davis, The recognition of human movement using temporal templates, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.3, 2001.
DOI : 10.1109/34.910878

D. Weinland, R. Ronfard, and E. Boyer, Free viewpoint action recognition using motion history volumes, Computer Vision and Image Understanding, vol.104, issue.2-3, pp.249-257, 2006.
DOI : 10.1016/j.cviu.2006.07.013

URL : https://hal.archives-ouvertes.fr/inria-00544629

D. Weinland, E. Boyer, and R. Ronfard, Action Recognition from Arbitrary Views using 3D Exemplars, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408849

URL : https://hal.archives-ouvertes.fr/inria-00544741

A. Veeraraghavan, R. Chellappa, and A. K. Roy-chowdhury, The Function Space of an Activity, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.304

P. K. Turaga, A. Veeraraghavan, and R. Chellappa, From videos to verbs: Mining videos for events using a cascade of dynamical systems, CVPR, 2007.

P. K. Turaga, A. Veeraraghavan, and R. Chellappa, Statistical analysis on Stiefel and Grassmann manifolds with applications in computer vision, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587733

P. K. Turaga and R. Chellappa, Locally time-invariant models of human activities using trajectories on the grassmanian, CVPR, 2009.

K. Kulkarni, S. Cherla, A. Kale, and V. Ramasubramanian, A framework for indexing human actions in video, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00326719

S. Carlsson and J. Sullivan, Action recognition by shape matching to key frames, 2001.

K. Schindler and L. V. Gool, Action snippets: How many frames does human action recognition require?, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587730

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.329.7430

D. Weinland and E. Boyer, Action recognition using exemplar-based embedding, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587731

URL : https://hal.archives-ouvertes.fr/inria-00590256

A. S. Ogale, A. Karapurkar, and Y. Aloimonos, View-Invariant Modeling and Recognition of Human Actions Using Grammars, ICCV Workshops, 2005.
DOI : 10.1007/978-3-540-70932-9_9

H. Ney, The use of a one-stage dynamic programming algorithm for connected word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.32, issue.2, pp.263-270, 1984.
DOI : 10.1109/TASSP.1984.1164320

V. Ramasubramanian, K. Kulkarni, and B. Kaemmerer, Acoustic modeling by phoneme templates and modified one-pass DP decoding for continuous speech recognition, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008.
DOI : 10.1109/ICASSP.2008.4518557

D. Weinland, R. Ronfard, and E. Boyer, Automatic Discovery of Action Taxonomies from Multiple Views, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.65

URL : https://hal.archives-ouvertes.fr/inria-00590216

T. Svendsen and F. Soong, On the automatic segmentation of speech signals, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1987.
DOI : 10.1109/ICASSP.1987.1169628

V. Ramasubramanian and T. Sreenivas, Automatically derived units for segment vocoders, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.473-479, 2004.
DOI : 10.1109/ICASSP.2004.1326025

R. Zelinski and F. Class, A learning procedure for speaker-dependent word recognition systems based on sequential processing of input tokens, ICASSP '83. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1983.
DOI : 10.1109/ICASSP.1983.1171906