East meets West: Producing Multilingual Resources in a European Context

Abstract : The EU concerted action TELRI has released a two-volume CD-ROM, which contains multilingual language resources, namely corpora, lexica, and tools for language engineering. This CD-ROM provides harmonised resources for unprecedented numbers and kinds of languages, mainly from non-EU countries, for which such resources still tend to be scarce. The first volume of the CD includes the aligned text of Plato’s Republic in twenty one languages, while the second volume contains extended results of the EU MULTEXTEast project, including the aligned and tagged novel ’1984’ by Goerge Orwell and accompanying lexica in seven languages. The paper presents the CD-ROM, the methods employed in its creation and its prospective uses.
Document type :
Conference papers
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/inria-00526965
Contributor : Laurent Romary <>
Submitted on : Sunday, January 3, 2016 - 4:10:02 PM
Last modification on : Monday, July 8, 2019 - 3:30:28 PM

File

lrec-cd.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : inria-00526965, version 1

Collections

Citation

Tomaz Erjavec, Ann Lawson, Laurent Romary. East meets West: Producing Multilingual Resources in a European Context. LREC, 1998, Grenade, Spain. ⟨inria-00526965⟩

Share

Metrics

Record views

432

Files downloads

58