A. Allauzen and J. Gauvain, Adaptation automatique du modèle de langage d'un système de transcription de journaux parlés, 2003.

J. F. Allen, Maintaining knowledge about temporal intervals, Communications of the ACM, vol.26, issue.11, pp.832-843, 1983.
DOI : 10.1145/182.358434

J. F. Allen, A general model of action and time, Artificial Intelligence, vol.23, issue.2, 1984.

O. Aubert, P. Champin, and Y. Prié, Integration of semantic web technology in an annotation-based hypervideo system, Workshop on Semantic Web Annotations for Multimedia (SWAMM'06), 2006.

O. Aubert and Y. Prié, Advene, Proceedings of the sixteenth ACM conference on Hypertext and hypermedia , HYPERTEXT '05, pp.235-244, 2005.
DOI : 10.1145/1083356.1083405

URL : https://hal.archives-ouvertes.fr/hal-01503413

G. Auffret, J. Carrive, O. Chevet, and T. Dechilly, Audiovisual-based hypermedia authoring, Proceedings of the tenth ACM Conference on Hypertext and hypermedia : returning to our diverse roots returning to our diverse roots, HYPERTEXT '99, 1999.
DOI : 10.1145/294469.294620

URL : https://hal.archives-ouvertes.fr/inria-00545189

F. Baader, D. Calvanese, D. Mcguiness, D. Nardi, and P. Patel-schneider, The Description Logic Handbook, 2002.
DOI : 10.1017/CBO9780511711787

W. Bailer and P. Schallauer, The Detailed Audiovisual Profile: Enabling Interoperability between MPEG-7 Based Systems, 2006 12th International Multi-Media Modelling Conference, pp.217-224, 2006.
DOI : 10.1109/MMMC.2006.1651323

V. Beaudoin, Mètres et Rythmes du Vers Classique, 2002.

V. Beaudoin and F. Yvon, The Metrometer: a Tool for Analysing French Verse, Literary and Linguistic Computing, vol.11, issue.1, 1996.
DOI : 10.1093/llc/11.1.23

H. Bowman, H. Cameron, P. King, and S. Thompson, Mexitl: Multimedia in executable interval temporal logic, 1997.

V. Brunie, J. Carrive, and L. Vinet, Ing??nierie des documents audiovisuels : le projet FERIA. Une approche centr??e sur la description des contenus, Techniques et sciences informatiques, vol.25, issue.4, pp.469-496, 2006.
DOI : 10.3166/tsi.25.469-496

M. Caillet, Un système expert d'aide à la classification taxonomique de classes de descripteurs, Ingénierie des Connaissances, 2007.

J. Carrive, F. Pachet, and R. Ronfard, Clavis -a temporal reasoning system for classification of audiovisual sequences, Proceedings of RIAO, 2000.
URL : https://hal.archives-ouvertes.fr/inria-00590130

L. Daigle, D. Van-gulik, R. Iannella, and P. Faltstrom, Uniform Resource Names (URN) Namespace Definition Mechanisms, 2002.
DOI : 10.17487/rfc3406

R. Deltour and C. Roisin, The limsee3 multimedia authoring model, Proceedings of the 2006 ACM symposium on Document engineering , DocEng '06, 2006.
DOI : 10.1145/1166160.1166203

URL : https://hal.archives-ouvertes.fr/inria-00189360

J. Glass, T. Hazen, S. Cyphers, I. Malioutov, and R. Barzilay, Progress in spoken lecture processing, Int. Conf. on Spoken Language Processing, 2006.

R. Goularte, E. , S. Moreira, M. Da-graça, and C. Pimentel, Structuring interactive TV documents, Proceedings of the 2003 ACM symposium on Document engineering , DocEng '03, 2003.
DOI : 10.1145/958220.958229

G. Gravier, F. Yvon, B. Jacob, and F. Bimbot, Sirocco, un système ouvert de reconnaissance de la parole, XXIVe Journées d'Études sur la Parole (JEP'02), 2002.

L. Hardman, D. C. Bulterman, and G. Van-rossum, The Amsterdam hypermedia model: adding time and context to the Dexter model, Communications of the ACM, vol.37, issue.2, pp.50-62, 1994.
DOI : 10.1145/175235.175239

I. Horrocks, P. F. Patel-schneider, and F. Van-harmelen, From SHIQ and RDF to OWL: the making of a Web Ontology Language, Web Semantics: Science, Services and Agents on the World Wide Web, vol.1, issue.1, pp.7-26, 2003.
DOI : 10.1016/j.websem.2003.07.001

J. Hunter, Adding multimedia to the semantic web building an mpeg-7 ontology, 1st Int. Semantic Web Working Symposium SWWS'01, 2001.

M. Jourdan, N. Layaïda, C. Roisin, L. Sabry-ismaïl, and L. Tardif, Madeus, an authoring environment for interactive multimedia documents, ACM Multimedia '98, 1998.

J. Lewis, Automated lip-sync: Backgrounds and techniques. Visualization and Computer Animation, 1991.
DOI : 10.1002/vis.4340020404

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.25.6217

K. Liu and H. Chen, Exploring media correlation and synchronization for navigated hypermedia documents, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005.
DOI : 10.1145/1101149.1101159

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.109.6386

C. L. Madhwacharyula, M. Davis, P. Mulhem, and M. S. Kankanhalli, Metadata handling, ACM Transactions on Multimedia Computing, Communications, and Applications, vol.2, issue.4, 2006.
DOI : 10.1145/1201730.1201736

J. M. Martínez, R. Koenen, and F. Pereira, MPEG-7: the generic multimedia content description standard, part 1, IEEE Multimedia, vol.9, issue.2, pp.78-87, 2002.
DOI : 10.1109/93.998074

J. M. Martínez, R. Koenen, and F. Pereira, MPEG-7: the generic multimedia content description standard, part 1, IEEE Multimedia, vol.9, issue.2, pp.83-93, 2002.
DOI : 10.1109/93.998074

D. L. Mcguinness and F. Van-harmelen, Owl web ontology language overview, 2004.

M. Mohri, F. C. Pereira, and M. Riley, Weighted finite-state transducers in speech recognition, Computer Speech & Language, vol.16, issue.1, pp.69-88, 2002.
DOI : 10.1006/csla.2001.0184

R. Ronfard and T. T. Thuong, A framework for aligning and indexing movies with their script, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698), 2003.
DOI : 10.1109/ICME.2003.1220844

URL : https://hal.archives-ouvertes.fr/inria-00423417

T. K. Shih, L. Hwang, and J. Tsai, Formal model of temporal properties underlying multimedia presentations, Multimedia Modeling, 1996.

R. Troncy, W. Bailer, M. Hausenblas, and R. Schlatte, Enabling Multimedia Metadata Interoperability by Defining Formal Semantics of MPEG-7 Profiles, 1st Int. Conf. on Semantic and Digital Media Technologies (SAMT'06), pp.41-55, 2006.
DOI : 10.1007/11930334_4

R. Troncy, J. Carrive, S. Lalande, and J. Poli, A motivating scenario for designing an extensible audio-visual description language, CORIMEDIA'04, 2004.