An Efficient, Streamable Text Format for Multimedia Captions and Subtitles

Dick C.A. Bulterman 1 Jack Jansen 1 Pablo Cesar 1 Samuel Cruz-Lara 2
2 TALARIS - Natural Language Processing: representation, inference and semantics
Inria Nancy - Grand Est, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In spite of the high profile of media types such as video, audio and images, many multimedia presentations rely extensively on text content. Text can be used for incidental labels, or as subtitles or captions that accompany other media objects. In a multimedia document, text content is not only constrained by the need to support presentation styles and layout, it is also constrained by the temporal context of the presentation. This involves intra-text and extra-text timing synchronization with other media objects. This paper describes a new timed-text representation language that is intended to be embedded in a non-text host language. Our format, which we call aText (for the Ambulant Text Format), balances the need for text styling with the requirement for an efficient representation that can be easily parsed and scheduled at runtime. aText, which can also be streamed, is defined as an embeddable text format for use within declarative XML languages. The paper presents a discussion of the requirements for the format, a description of the format and a comparison with other existing and emerging text formats. We also provide examples for aText when embedded within the SMIL and MLIF languages and discuss our implementation experiences of aText with the Ambulant Player.
Type de document :
Communication dans un congrès
ACM Symposium on Document Engineering - DocEng 2007, Aug 2007, Winnipeg, Canada. 2007
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00192467
Contributeur : Samuel Cruz-Lara <>
Soumis le : mercredi 28 novembre 2007 - 10:51:43
Dernière modification le : jeudi 11 janvier 2018 - 06:21:35
Document(s) archivé(s) le : lundi 12 avril 2010 - 05:22:12

Fichier

DocEng_aText-Latest.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00192467, version 1

Collections

Citation

Dick C.A. Bulterman, Jack Jansen, Pablo Cesar, Samuel Cruz-Lara. An Efficient, Streamable Text Format for Multimedia Captions and Subtitles. ACM Symposium on Document Engineering - DocEng 2007, Aug 2007, Winnipeg, Canada. 2007. 〈inria-00192467〉

Partager

Métriques

Consultations de la notice

245

Téléchargements de fichiers

219