YaMTG: An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora

Abstract : This paper describes YaMTG (Yet another Multilingual Translation Graph), a new open-source heavily multilingual translation database (over 664 languages represented) built using several sources, namely various wiktionaries and the OPUS parallel corpora (Tiedemann, 2009). We detail the translation extraction process for 21 wiktionary language editions, and provide an evaluation of the translations contained in YaMTG.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-01022306
Contributor : Benoît Sagot <>
Submitted on : Thursday, July 10, 2014 - 11:38:11 AM
Last modification on : Thursday, August 29, 2019 - 2:24:06 PM
Long-term archiving on : Friday, October 10, 2014 - 11:31:38 AM

File

lrec14tradgraph.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01022306, version 1

Collections

Citation

Valérie Hanoka, Benoît Sagot. YaMTG: An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora. Language Resources and Evaluation Conference, European Language Resources Association, May 2014, Reykjavik, Iceland. ⟨hal-01022306⟩

Share

Metrics

Record views

269

Files downloads

375