Skip to Main content Skip to Navigation
Conference papers

Standardizing the management and the representation of multilingual data: the MultiLingual Information Framework

Samuel Cruz-Lara 1 Nadia Bellalem 1 Julien Ducret 1 Isabelle Kramer 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The extremely fast evolution of the technological development in the sector of Communication and Information Technologies, and in particular, in the field of natural language processing, makes particularly acute the question of standardization. The issues related to this standardization are of industrial, economic and cultural nature. This article presents a methodology of standardization, in order to harmonize the management and the representation of multilingual data. Indeed, the control of the interoperability between the industrial standards currently used for localization (XLIFF)[1], translation memory (TMX)[2], or with some recent initiatives such as the internationalization tag set (ITS)[3], constitutes a major objective for a coherent and global management of these data. MLIF (Multi Lingual Information Framework)[4] is based on a methodology of standardization resulting from the ISO (sub-committees TC37/SC3 "Computer Applications for Terminology" and SC4 "Language Resources Management"). MLIF should be considered as a unified conceptual representation of multilingual content. MLIF does not have the role to substitute or to compete with any existing standard. MLIF is being designed with the objective of providing a common conceptual model and a platform allowing interoperability among several translation and localization standards, and by extension, their committed tools. The asset of MLIF is the interoperability which allows experts to gather, under the same conceptual unit, various tools and representations related to multilingual data. In addition, MLIF will also make it possible to evaluate and to compare these multilingual resources and tools.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/inria-00105653
Contributor : Samuel Cruz-Lara <>
Submitted on : Wednesday, October 11, 2006 - 5:57:46 PM
Last modification on : Friday, February 26, 2021 - 3:28:03 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 7:26:07 PM

Identifiers

  • HAL Id : inria-00105653, version 1

Collections

Citation

Samuel Cruz-Lara, Nadia Bellalem, Julien Ducret, Isabelle Kramer. Standardizing the management and the representation of multilingual data: the MultiLingual Information Framework. Third International Workshop on Language Resources for Translation Work, Research & Training, May 2006, Genoa/Italy. ⟨inria-00105653⟩

Share

Metrics

Record views

366

Files downloads

664