Skip to Main content Skip to Navigation
Conference papers

An Efficient, Streamable Text Format for Multimedia Captions and Subtitles

Dick C.A. Bulterman 1 Jack Jansen 1 Pablo Cesar 1 Samuel Cruz-Lara 2 
2 TALARIS - Natural Language Processing: representation, inference and semantics
Inria Nancy - Grand Est, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In spite of the high profile of media types such as video, audio and images, many multimedia presentations rely extensively on text content. Text can be used for incidental labels, or as subtitles or captions that accompany other media objects. In a multimedia document, text content is not only constrained by the need to support presentation styles and layout, it is also constrained by the temporal context of the presentation. This involves intra-text and extra-text timing synchronization with other media objects. This paper describes a new timed-text representation language that is intended to be embedded in a non-text host language. Our format, which we call aText (for the Ambulant Text Format), balances the need for text styling with the requirement for an efficient representation that can be easily parsed and scheduled at runtime. aText, which can also be streamed, is defined as an embeddable text format for use within declarative XML languages. The paper presents a discussion of the requirements for the format, a description of the format and a comparison with other existing and emerging text formats. We also provide examples for aText when embedded within the SMIL and MLIF languages and discuss our implementation experiences of aText with the Ambulant Player.
Complete list of metadata

Cited literature [8 references]  Display  Hide  Download
Contributor : Samuel Cruz-Lara Connect in order to contact the contributor
Submitted on : Wednesday, November 28, 2007 - 10:51:43 AM
Last modification on : Wednesday, April 6, 2022 - 3:48:36 PM
Long-term archiving on: : Monday, April 12, 2010 - 5:22:12 AM


Files produced by the author(s)


  • HAL Id : inria-00192467, version 1



Dick C.A. Bulterman, Jack Jansen, Pablo Cesar, Samuel Cruz-Lara. An Efficient, Streamable Text Format for Multimedia Captions and Subtitles. ACM Symposium on Document Engineering - DocEng 2007, Aug 2007, Winnipeg, Canada. ⟨inria-00192467⟩



Record views


Files downloads